Information Retrieval at the Millenium

Size: px
Start display at page:

Download "Information Retrieval at the Millenium"

Transcription

1 Information Retrieval at the Millenium William Hersh, M.D. Division of Medical Informatics and Outcomes Research Oregon Health Sciences University Portland, OR Information retrieval systems were among the first medical informatics applications, yet their use has changed substantially in this decade with the growth of end-user computers and the Internet. While early challenges revolved around how to increase the amount of information available in electronic form, more recent challenges center on how to manage the growing volume. Traditional information retrieval issues such as how to organize and index information to make it more retrievable as well as how to evaluate the effectiveness of systems are still as pertinent as ever. Introduction Information retrieval (IR) systems that is, systems to catalog and provide information about documents were among the first applications of computers. Their potential for organizing and allowing access to the medical literature was recognized by the National Library of Medicine (NLM), and by 1966, the MEDLINE database was launched [1]. At that time, only specially-trained intermediaries could access MEDLINE, and to do so required mailing a search statement that had a several week turnaround time. As we enter the 21 st century, the access to on-line medical information has changed substantially. A large spectrum of medical information is now available electronically not only journal references, but full-text journal literature, textbooks, image collections, and all sorts of other resources. MEDLINE can be searched instantaneously and for free by anyone in the world with access to the Internet. This paper will review the state of medical IR systems at the millenium. In particular, two questions will be addressed: 1. What is new in the IR marketplace, with a particular focus on how the fruits of research now benefit commercially available content and systems? 2. What is new in IR research, focusing on approaches that demonstrate real-world utility to users? Background This paper assumes a basic familiarity with the tenets of IR. A variety of resources exist for achieving such familiarity: 1. A textbook on medical IR [2]. 2. Well-known general textbooks on IR [3-5]. 3. How-to guides for accessing on-line medical information [6, 7]. This section, however, will provide an abbreviated overview to define the core principles and terminology. The ultimate goal of using an IR system is to retrieve documents containing information. While documents at the millenium may be electronic and contain images, sounds, and other multimedia elements in addition to text, the focus of IR systems is still largely the retrieval of text. In order for the user to retrieve documents, he or she must enter queries requesting documents. Most IR system queries consist of typing text at a keyboard, but queries in the next millenium may be entered by voice or manipulation of graphical icons. In order for queries to be matched to documents, there must be an indexing language, which is a set of descriptors that describe the contents of documents and can be entered by users to retrieve them. A search engine is the computer program that uses the indexing language to match queries and documents for the user. There are two intellectual processes in the IR process, indexing and retrieval. These processes require human intelligence, either to carry them out directly or to develop computer programs that perform them. Indexing is the process of assigning terms from the indexing language to the document, whereas retrieval is the procedure of entering terms to obtain documents or their surrogates. These processes will be discussed next, followed by an overview of the means by which IR systems are evaluated. Indexing There are two general approaches to indexing, which are often labeled human and automated. Human

2 indexing typically consists of the assignment of indexing terms from a controlled vocabulary by a specially-trained indexer. The indexing language consists of the terms from the controlled vocabulary. The largest human indexing operation in the medical domain is performed by the NLM for MEDLINE. Using terms from the Medical Subject Headings (MeSH) vocabulary [8], indexers follow a protocol to assign about 5-10 terms per article [9]. The second major approach to document indexing is automated or word indexing. This approach is done strictly by computer programs that identify and designate all words in the document as indexing terms. Some systems filter out common words that have little retrieval value from a stop list (e.g., the, of). Some systems also perform stemming, whereby plurals and common suffixes are removed from words (e.g., coughs and coughing are reduced to cough). After stop word removal and stemming, the remaining word stems comprise the indexing language in the automated indexing approach. An increasing number of IR systems employ an additional step beyond simple word recognition, which is term weighting. This process, originated by Salton in the 1960s but not adopted widely until the 1990s, is usually coupled with natural language queries and relevance ranking, which will be described below [10]. A variety of weighting measures have been implemented over the years, but the one which has shown the greatest general performance is the IDF*TF scheme. In this approach, terms in documents are given weight based on how frequently they occur in the document (term frequency or TF) and how infrequently they occur in others (inverse document frequency or IDF). Retrieval There are likewise two major approaches to retrieval, Boolean and natural language. Each type of retrieval can be used with either type of indexing term, controlled vocabulary or word. In Boolean searching, the user enters terms and connects them via the Boolean operators AND and OR. The AND operator returns documents that contain all of the specified terms, whereas the OR operator returns those that contain any of them. Natural language retrieval does not use Boolean operators. Instead, the user enters a natural language query statement. This approach is typically coupled with term weighting, which through the techniques pioneered by Salton, allows documents not only be to matched, but also ranked for relevance [10]. In the process of relevance ranking, retrieved documents are sorted by presence and frequency of query terms, or some variation thereof. Evaluation Understanding the effectiveness of IR systems is important not only for researchers but also for those who use and purchase them. All of these individuals must have effective means for knowing how well such systems are suited for their task and how they can be improved. The most commonly used evaluation measures are recall and precision. Recall and precision are based on the notion of documents being relevant to an information need [11]. Recall is the proportion of relevant documents in a collection that are retrieved (sensitivity). Precision is the proportion of retrieved documents in a search that are relevant (positive predictive value). Recall and precision are often measured using a test collection of known queries, documents, and relevance judgments. For systems that perform relevance ranking, a table or graph combining recall and precision can be derived. In this approach, precision is measured at fixed intervals of recall, e.g., 0, 0.1, 0.2, and so forth up to 1.0. This allows aggregate measures combining recall and precision to be developed. The most common aggregate measures used are average precision of the fixed points of recall, or precision at some number fixed of documents retrieved, e.g., 20. These approaches are very easy to use with batch studies that require no user. Some authors have criticized over-reliance on recall and precision as well as the use of test collections in general [12, 13]. It is unclear, for example, whether small differences in recall and precision have any effect on a user s ultimate success at searching. What is new in the IR marketplace? Now that a basic overview of IR has been given, various trends in the IR marketplace can be described. Four trends can be gleaned from an overview of the marketplace: the growth of free resources, the emergence of aggregated and/or synthesized resources, the development and evolution of Web search engines, and the adaptation of techniques from research systems into commercial ones.

3 Free resources A variety of Web-based medical resources are now available without charge. The most notable among these is MEDLINE. In June, 1997, the NLM announced that Web-based MEDLINE would be available for free on its Web site. As two Web-based MEDLINE systems had been developed PubMed ( and Internet Grateful Med (igm.nlm.nih.gov) both were made available without charge. Both systems provide a number of innovative searching features. PubMed provides a very simple, easy-to-use interface. It allows relevance feedback, whereby users can select a retrieved reference and obtain more references with similar MeSH terms. PubMed also implements queries that provide the best evidence for certain types of clinical questions, based on research by Haynes et al. [14]. It also establishes a mechanism to provide direct links to the full text of references when available elsewhere on the Internet. Internet Grateful Med provides more databases than PubMed and implements user feedback via the COACH system [15]. The NLM is not the only health-related government agency to use the Web for dissemination of free information. Others, including the Centers for Disease Control and Prevention ( the Food and Drug Administration ( and the National Cancer Institute ( have also made databases and other information available. A forthcoming clearinghouse of clinical practice guidelines is being developed by the Agency for Health Care Policy Research ( There is also a government resource for consumer health information, HealthFinder ( Most consumer-oriented Web sites have also adopted the approach of providing information for free and supporting themselves by advertising. Most information for health care providers, on the other hand, still requires payment. This is not surprising, since the cost of producing this information is high and the market for it is limited, at least compared with consumer-oriented information. Only a handful of clinical sites have adopted the free-withadvertising model, one of which is Medscape ( Some have expressed concern about the quality of free information on the Web. Silberg et al. have suggested standards for Web-based health information [16], including the presence on all health-related Web sites of each page s: Authorship names, affiliations, and credentials Attribution references, sources, and (where appropriate) copyright Disclosure potential and real conflicts of interest Currency dates content posted and updated Aggregation and synthesis of content Many medical publishers have focused on aggregating content. Most of these products began as CD-ROMs which are now being adapted to the Web. Two well-known products that aggregate medical textbooks are Stat!-Ref (Teton Data, Jackson, WY) and Harrison's Plus (McGraw-Hill, New York, NY). Some newer products that are only Web-based include Primary Care Online (Lippincott, Williams, and Wilkins, Philadelphia, PA) and MDConsult (St. Louis, MO). Another form of aggregated on-line information increasingly available is the linkage of bibliographic references with full text journal articles. Clinicians have always preferred not only aggregated information, but synthesized sources as well. One of the oldest paper-based products, now available on the Web, is the Yearbook Series (Mosby, St. Louis, MO). In recent years, new content has appeared based on the evolution of evidence-based medicine (EBM) [17]. While the initial approach to EBM focused on literature retrieval and its critical appraisal by the clinician, advocates have found that routine searching of the literature by clinicians is impractical and that clinicians have limited skills in these areas. As a result, publishers and others have developed synthesized content based on EBM approaches. One of the first forms of evidence-based synthesized content is the extended structured abstract. The first product to do this was ACP Journal Club (American College of Physicians, Philadelphia, PA), which focused on internal medicine. Another journal, Evidence-Based Medicine (American College of Physicians and British Medical Journal, London, UK), extends this approach to all of health care. A similar product, Evidence-Based Practice, has been developed by Appleton & Lange (Stamford, CT) which features shorter synopses but emphasis on patient-oriented (e.g., mortality, symptom reduction) as opposed to disease-oriented (e.g., test result improvement) evidence. Another emerging form of evidence-based synthesized content is the systematic review. These reviews are different from ordinary review articles,

4 which are not often as comprehensive and methodologically sound as they could be. Systematic reviews are instead based on exhaustive literature review and advanced statistical methods, including meta-analysis [18]. The best known producers of systematic reviews are the Cochrane Collaboration [19] and the Agency for Health Care Policy Research Evidence-Based Practice Centers. An on-line version of EBM resources, Evidence-Based Medicine Reviews, has become available from Ovid Technologies (New York, NY). The goal of the Cochrane Collaboration is to produce a database of systematic reviews on all interventions in health care ( Their work is predicated on the principle that the best means to assess the efficacy of a health care intervention is the randomized controlled trial. Their approach includes an exhaustive search for all trials, including those published in foreign language journals or not published at all. Meta-analysis is used where appropriate to aggregate results from like trials. The reviews are updated as results of new trials become available. There are limitations of the Cochrane approach. From an economic standpoint, it is not clear whether there is a sustainable market for reviews that can fund the infrastructure needed to develop and maintain reviews. There is also a chicken and egg problem in that the Cochrane database does not currently have enough reviews to be clinically useful, hence users will not purchase it. Another limitation is the reliance on volunteer authors of reviews, whom have other constraints on their time competing with the ongoing maintenance of their reviews. Finally, randomized controlled trials and meta-analysis have limitations [20]. Web search engines Unlike MEDLINE and CD-ROM textbooks, the Web is not a single database. Rather, it is a dynamic mass of information that changes every time someone adds, deletes, or changes a page. Therefore there can be no search system that can search the entire Web. Search systems for the Web have taken two forms: Web crawlers and filtering-classifying systems. Web crawlers Web crawlers index the words on Web pages. Starting at a seed site, pages are identified by following the links from these to other pages. There is no discrimination of the information that is indexed; everything encountered is added to the database. Most Web crawlers are queried by natural language searching with relevance ranking of the output. The Search buttons on Netscape and Internet Explorer take users to a page that provides access to most of the Web search engines. Some well-known Web crawlers include AltaVista altavista.digital.com Infoseek Excite HotBot Filtering and classifying systems Some Web search engines take a different approach. They filter information, based on certain criteria such as quality or clinical relevance, and/or use classification schemes, such as MeSH, to provide better indexing. Some systems provide elaborate filtering: Medical Matrix Yahoo Health Others provide both filtering and classifying: CliniWeb Medical World Search In CliniWeb, for example, pages that are written to the level of health care students and above are indexed with terms from a subset of the MeSH vocabulary [21]. Adaptation of research techniques A variety of IR techniques formerly viewed as research are now used widely in commercial systems. For example, most Web search engines use natural language searching with relevance ranking that was pioneered by Salton in the 1960s [10]. Other research techniques that have been adopted include relevance feedback and query augmentation. In relevance feedback, the system adds new search terms to the query based on those present in documents that the user has designated as relevant [22]. The most common approach used on the Web (e.g., PubMed, Excite) is to allow the user to select more documents like this one. These systems add words (Excite) or MeSH terms (PubMed) from relevant documents to the query. Query augmentation is the process of presenting the user with suggested words or phrases from retrieved documents that are not in the original query that the user may add (e.g., [23]). In Excite, the list of suggested terms appears across the top of the page.

5 This approach can also be limited to terms from documents that are designated relevant. What is new in IR research? Despite the growth of commercial IR applications, research into new methods is thriving as well. One of the major events fueling research is the Text Retrieval Conference (TREC). A variety of other research methods are being developed outside of TREC, as are new approaches to evaluating how well systems perform. TREC experiments Organized by the National Institutes for Standards and Technology (NIST), the TREC conference is designed to allow different research groups to work on a common large test collection [24]. While not designed to be a competition, it does allow various groups to compare their techniques with others. The test collection consists of several gigabytes of newswire, computer publications, and government reports, along with a set of real-world queries and relevance judgements regarding which documents are relevant to the queries. A Web site provides more details, including proceedings from past conferences (trec.nist.gov). Logistics of TREC TREC was initiated in 1992 and has been held annually since then. The first six conferences were organized around two major tasks, ad hoc retrieval and routing. The ad hoc task was the typical retrieval task, with queries searching against a database of unknown documents. The goal of the routing task, on the other hand, was to identify new documents based not only on queries but also on documents previously identified as relevant. This was a variation of the relevance feedback task, where systems could use terms from known relevant documents to improve their queries. With the TREC- 7 conference in 1998, routing will be eliminated as a major task. For each task, there is a database of content consisting of about two gigabytes of text (one-half to one million documents) and 50 queries that contain a statement of information need and a definition of what constitutes a relevant document. Each participating group submits two runs consisting of ranked documents for each query. All documents in any group s top 100 documents are submitted to relevance judges, with the remainder assumed not relevant. NIST then prepares recall-precision results for each group. In addition to the major tasks, a number of tracks have developed with focus on specific areas. These include: Interactive Since most of the TREC experiments are system-oriented, this track aims to focus more on the user. Natural language processing (NLP) For groups using NLP techniques, this track provides an opportunity to focus on such approaches. Very large corpus Some groups are interested in issues related to extremely large collections, to which this group has access. Filtering A variant of routing that requires a binary instead of ranked decision on document selection. What has been learned in TREC? While there is some concern that the batch-style experiments of TREC and its focused subject domain limit the generalizability of the results obtained, it does provide a platform for assessing diverse approaches to IR. Furthermore, results from similar approaches implemented across different systems have shown consistency. The approaches from TREC that consistently improve results have been new weighting algorithms, passage retrieval, and query expansion. Each of these techniques improve average precision in batch runs by about 10-20%. They have been implemented in a variety of systems whose underlying approach is different (e.g. [23, 25]). The new weighting techniques showing the most benefit include 2-Poission term weighting [26] and pivoted normalization [27]. Both of these measures normalize document length so that longer documents do not get higher relevance scores solely due to their weight. Passage retrieval was pioneered by Salton about ten years ago [28]. It is based on the premise that for full-text documents, information sought in a query is likely to be in one or more particular parts of a document. Passage retrieval is claimed to reduce linguistic ambiguity. In passage retrieval, full-text documents are broken into passages. Each passage is assigned a weight, with the highest-weighting passage(s) contributing to the documents overall weight. TREC results show that the best passages are overlapping fixed-length windows, not those based on any of the usual document structure (e.g., paragraphs) [29].

6 Query expansion has been tried for many years (e.g., [30]), but it did not appear to show benefit until the TREC experiments [23, 25], perhaps due to individual documents and the collection as a whole being much larger than those used in previous experiments. In query expansion, the top ranking documents are all assumed relevant and used in a relevance feedback process. Unlike relevance feedback, the initial query can be used, with the process requiring no action by the user. The TREC experiments have also identified approaches that do not improve results, at least in the context of these experiments. One of these approaches is NLP. Due to the large database, the traditional complete NLP approach is impractical in this setting. A number of groups have adopted partial NLP approaches that focus on identification of noun phrases and use of thesauri [23, 31]. None of these techniques, however, provide improved performance over the beneficial techniques described above, and they often do worse. Other medically-oriented IR research of note A variety of other important medical IR research findings have emerged in recent years. These studies have demonstrated the benefit of MeSH terms for searching MEDLINE, the value of query expansion in the medical domain, and the demonstration that conventional systems can be enhanced by new techniques. Hersh and Hickam found that users in an internal medicine clinic showed little benefit from some of the advanced retrieval features associated with MeSH indexing (e.g., subheadings, explosions), [32]. However, they did show that having the words from the MeSH terms available in the MEDLINE record to search against improved average precision in batchtype studies over just the title and abstract [33]. Srinivasan has also demonstrated that MeSH terms in MEDLINE records improves searching performance, which can be enhanced even more by query expansion using those terms [34]. A variety of research projects at the NLM have identified promise for improving MEDLINE retrieval. For example, an expert system to improve indexing may assist indexers, making their assignments more consistent [35]. On the retrieval side, the COACH expert system has been implemented within Grateful Med to help users diagnose faulty searches [15]. And of course the UMLS Project has the potential to offer a wider coverage of vocabulary terms for indexing and retrieval [36]. New approaches to evaluation Another important area of IR research attracting increased interest is in evaluation methodology itself. As noted above, a number of investigators have questioned the traditional approach of recall and precision [12, 13]. They have noted that relevance judgements are often inconsistent. It has also been asserted that recall and precision were developed with a library orientation but may not be applicable to other areas where IR systems are used, such as the busy clinical setting. Finally, the most important question concerning IR systems is whether information leads the user to make better decisions or have better outcomes of care. It should be noted that evaluating IR systems, especially in operational settings, can be a very difficult task. Unlike other informatics applications, such as knowledge-based decision support systems, the questions users pose may be diverse and/or vague. Health care decisions are also very complex and the IR system may only play a small or peripheral role in answering the question. Finally, as in all research studies, laboratory approaches control extraneous variables but introduce elements of unreality. A number of new approaches to evaluation have been undertaken. One approach for operational system observations has been the use of surrogate measures for clinical outcomes. Wyatt, for example, advocates using the clinician decision and not the patient outcome, which requires a smaller sample size to detect an impact, since the right decision does not improve every outcome [37]. Another technique, employed in laboratory evaluation, has been to assess a user s ability to complete a task. Hersh has focused on the ability of users to answer clinical questions [38, 39]. Further work is looking at factors associated with successful use of the system. Future directions for IR This decade has seen an explosion in growth of commercial IR systems and research on their use. While the Web provides a common platform to facilitate the use of IR systems, there are a number of issues that still hamper their effective use. Further improvements in content, systems, and their evaluation are essential. There must be continued commercial development as well as no strings

7 attached research to identify best approaches for systems and users. Acknowledgements The author would like to thank librarians Lynetta Sacherek and Andrea Ball as well as graduate student Susan Price, all members of his research staff, for their valuable comments and suggestions. References 1. Miles, W., A History of the National Library of Medicine: The Nation's Treasury of Medical Knowledge. 1982, Bethesda, MD: U.S. Dept. of Health and Human Services. 2. Hersh, W., Information Retrieval: A Health Care Perspective. 1996, New York: Springer-Verlag. 3. Salton, G., Introduction to Modern Information Retrieval. 1983, New York: McGraw-Hill. 4. Pao, M., Concepts of Information Retrieval. 1989, Englewood, CO: Libraries Unlimited. 5. Meadow, C., Text Information Retrieval Systems. 1992, San Diego: Academic Press. 6. Feinglos, S., MEDLINE: A Basic Guide to Searching. 1985, Chicago: Medical Library Association. 7. Albright, R., A Basic Guide to Online Information Systems for Health Care Professionals. 1988, Arlington, VA: Information Resources Press. 8. Lowe, H. and G. Barnett, Understanding and using the medical subject headings (MeSH) vocabulary to perform literature searches. Journal of the American Medical Association, : Bachrach, C. and T. Charen, Selection of MEDLINE contents, the development of its thesaurus, and the indexing process. Medical Informatics, (3): Salton, G., Developments in automatic text retrieval. Science, : Hersh, W. and D. Hickam, How well do physicians use electronic information retrieval systems? A framework for investigation and review of the literature. Journal of the American Medical Association, 1998: in press. 12. Swanson, D., Historical note: Information retrieval and the future of an illusion. Journal of the American Society for Information Science, : Hersh, W., Relevance and retrieval evaluation: perspectives from medicine. Journal of the American Society for Information Science, : Haynes, R., et al., Developing optimal search strategies for detecting clinically sound studies in MEDLINE. Journal of the American Medical Informatics Association, : Kingsland, L., et al., COACH: applying UMLS knowledge sources in an expert searcher environment. Bulletin of the Medical Library Association, : Silberg, W., G. Lundberg, and R. Musacchio, Assessing, controlling, and assuring the quality of medical information on the Internet: caveat lector et viewor - let the reader and viewer beware. Journal of the American Medical Association, : Sackett, D., et al., Evidence-Based Medicine: How to Practice and Teach EBM. 1997, New York: Churchhill Livingstone. 18. Mulrow, C. and D. Cook, eds. Systematic Reviews: Synthesis of Best Evidence for Health Care Decisions. 1998, American College of Physicians: Philadelphia. 19. Bero, L. and D. Rennie, The Cochrane Collaboration: preparing, maintaining, and disseminating systematic reviews of the effects of health care. Journal of the American Medical Association, ( ). 20. Feinstein, A., Meta-analysis: statistical alchemy for the 21st century. Journal of Clinical Epidemiology, : Hersh, W., et al., CliniWeb: managing clinical information on the World Wide Web. Journal of the American Medical Informatics Association, : Salton, G. and C. Buckley, Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, : Evans, D. and R. Lefferts. Design and evaluation of the CLARIT TREC-2 system. in The Second Text REtrieval Conference (TREC-2) Gaithersburg, MD: NIST: Harman, D., Overview of the second Text REtrieval Conference (TREC-2). Information Processing and Management, : Buckley, C., et al. Automatic query expansion using SMART: TREC 3. in Overview of the Third Text REtrieval

8 Conference (TREC-3) Gaithersburg, MD: NIST: Robertson, S. and S. Walker. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. in Proceedings of the 17th Annual International ACM Special Interest Group in Information Retrieval Dublin: Springer-Verlag: Singhal, A., C. Buckley, and M. Mitra. Pivoted document length normalization. in Proceedings of the 19th Annual International ACM Special Interest Group in Information Retrieval Zurich, Switzerland: ACM Press: Salton, G. and C. Buckley, Global text matching for information retrieval. Science, : Callan, J. Passage level evidence in document retrieval. in Proceedings of the 17th Annual International ACM Special Interest Group in Information Retrieval Dublin: Springer-Verlag: Smeaton, A., The retrieval effects of query expansion on a feedback document retrieval system. The Computer Journal, : Strzalkowski, T., J. Carballo, and M. Marinescu. Natural language information retrieval: TREC-3 report. in Overview of the Third Text REtrieval Conference (TREC- 3) Gaithersburg, MD: NIST: Hersh, W. and D. Hickam, The use of a multi-application computer workstation in a clinical setting. Bulletin of the Medical Library Association, : Hersh, W., et al. OHSUMED: an interactive retrieval evaluation and new large test collection for research. in Proceedings of the 17th Annual International ACM Special Interest Group in Information Retrieval Dublin: Springer-Verlag: Srinivasan, P., Retrieval Feedback in MEDLINE. Journal of the American Medical Informatics Association, : Humphrey, S., Medindex system: medical indexing expert system. Information Processing and Management, : Humphreys, B., et al., The Unified Medical Language System: an informatics research collaboration. Journal of the American Medical Informatics Association, : Wyatt, J. and D. Spiegelhalter. Field trials of medical decision-aids: potential problems and solutions. in Proceedings of the 15th Annual Symposium on Computer Applications in Medical Care Washington, DC: McGraw-Hill: Hersh, W., et al. Towards new measures of information retrieval evaluation. in Proceedings of the 18th Annual Symposium on Computer Applications in Medical Care Washington, DC: Hanley-Belfus: Hersh, W., J. Pentecost, and D. Hickam, A task-oriented approach to information retrieval evaluation. Journal of the American Society for Information Science, :

Maintaining a Catalog of Manually-Indexed, Clinically-Oriented World Wide Web Content

Maintaining a Catalog of Manually-Indexed, Clinically-Oriented World Wide Web Content Maintaining a Catalog of Manually-Indexed, Clinically-Oriented World Wide Web Content William Hersh, M.D., Andrea Ball, M.L.S., Bikram Day, M.S., Mary Masterson, M.P.H., Li Zhang, M.S., Lynetta Sacherek,

More information

Indexing and Retrieving Medical Literature

Indexing and Retrieving Medical Literature Evaluation of SAPHIRE: An Automated Approach to Indexing and Retrieving Medical Literature William Hersh, M.D. David H. Hickam, M.D., M.P.H. Oregon Health Sciences University Portland, Oregon, USA R. Brian

More information

EVIDENCE SEARCHING IN EBM. By: Masoud Mohammadi

EVIDENCE SEARCHING IN EBM. By: Masoud Mohammadi EVIDENCE SEARCHING IN EBM By: Masoud Mohammadi Steps in EBM Auditing the outcome Defining the question or problem Applying the results Searching for the evidence Critically appraising the literature Clinical

More information

Introduction to Ovid. As a Clinical Librarían tool! Masoud Mohammadi Golestan University of Medical Sciences

Introduction to Ovid. As a Clinical Librarían tool! Masoud Mohammadi Golestan University of Medical Sciences Introduction to Ovid As a Clinical Librarían tool! Masoud Mohammadi Golestan University of Medical Sciences Overview Ovid helps researchers, librarians, clinicians, and other healthcare professionals find

More information

Introduction to The Cochrane Library

Introduction to The Cochrane Library Introduction to The Cochrane Library What is The Cochrane Library? A collection of databases that include high quality, independent, reliable evidence from Cochrane and other systematic reviews, clinical

More information

in Evidence-Based Medicine

in Evidence-Based Medicine in Evidence-Based Medicine รศ.นพ. อน ร ธ ภะทรากาญจน Aims of Literature Search To solve clinical problems (EBM) To search for existing knowledge in order to conduct a research To update knowledge 5A of

More information

MICC 2017 Librarians and Evidence- Based Medical Practice: An Ever Closer Union. Michael Fanning, Customer Success - Training Manager 7 th June 2017

MICC 2017 Librarians and Evidence- Based Medical Practice: An Ever Closer Union. Michael Fanning, Customer Success - Training Manager 7 th June 2017 MICC 2017 Librarians and Evidence- Based Medical Practice: An Ever Closer Union Michael Fanning, Customer Success - Training Manager 7 th June 2017 Starting Point Question: What is the role of librarians

More information

Information retrieval (IR)

Information retrieval (IR) Information Retrieval (Search) What is Biomedical & Health Informatics? William Hersh, MD Copyright 2018 Oregon Health & Science University Information retrieval (IR) Field concerned with organization

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS 1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,

More information

Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data

Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data American Journal of Applied Sciences (): -, ISSN -99 Science Publications Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data Ibrahiem M.M. El Emary and Ja'far

More information

THE BIG FLAME A MODEL FOR A UNIVERSAL FULL-TEXT ELECTRONIC LIBRARY OF RESEARCH

THE BIG FLAME A MODEL FOR A UNIVERSAL FULL-TEXT ELECTRONIC LIBRARY OF RESEARCH THE A MODEL FOR A UNIVERSAL FULL-TEXT ELECTRONIC LIBRARY OF RESEARCH David Ball, Chris Spice (Bournemouth University) 1. INTRODUCTION Academic publishing, after total reliance on the printed page since

More information

A TEXT MINER ANALYSIS TO COMPARE INTERNET AND MEDLINE INFORMATION ABOUT ALLERGY MEDICATIONS Chakib Battioui, University of Louisville, Louisville, KY

A TEXT MINER ANALYSIS TO COMPARE INTERNET AND MEDLINE INFORMATION ABOUT ALLERGY MEDICATIONS Chakib Battioui, University of Louisville, Louisville, KY Paper # DM08 A TEXT MINER ANALYSIS TO COMPARE INTERNET AND MEDLINE INFORMATION ABOUT ALLERGY MEDICATIONS Chakib Battioui, University of Louisville, Louisville, KY ABSTRACT Recently, the internet has become

More information

EBP. Accessing the Biomedical Literature for the Best Evidence

EBP. Accessing the Biomedical Literature for the Best Evidence Accessing the Biomedical Literature for the Best Evidence Structuring the search for information and evidence Basic search resources Starting the search EBP Lab / Practice: Simple searches Using PubMed

More information

Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms

Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms Yikun Guo, Henk Harkema, Rob Gaizauskas University of Sheffield, UK {guo, harkema, gaizauskas}@dcs.shef.ac.uk

More information

The impact of patient, intervention, comparison, outcome (PICO) as a search strategy tool on literature search quality: a systematic review

The impact of patient, intervention, comparison, outcome (PICO) as a search strategy tool on literature search quality: a systematic review The impact of patient, intervention, comparison, outcome (PICO) as a search strategy tool on literature search quality: a systematic review Mette Brandt Eriksen, PhD; Tove Faber Frandsen, PhD APPENDIX

More information

EVALUATION OF SEARCHER PERFORMANCE IN DIGITAL LIBRARIES

EVALUATION OF SEARCHER PERFORMANCE IN DIGITAL LIBRARIES DEFINING SEARCH SUCCESS: EVALUATION OF SEARCHER PERFORMANCE IN DIGITAL LIBRARIES by Barbara M. Wildemuth Associate Professor, School of Information and Library Science University of North Carolina at Chapel

More information

FIGURE 1. The updated PubMed format displays the Features bar as file tabs. A default Review limit is applied to all searches of PubMed. Select Englis

FIGURE 1. The updated PubMed format displays the Features bar as file tabs. A default Review limit is applied to all searches of PubMed. Select Englis CONCISE NEW TOOLS AND REVIEW FEATURES OF FOR PUBMED CLINICIANS Clinicians Guide to New Tools and Features of PubMed DENISE M. DUPRAS, MD, PHD, AND JON O. EBBERT, MD, MSC Practicing clinicians need to have

More information

The basics of searching biomedical databases. Francesca Frati, MLIS. Learning Outcomes. At the end of this workshop you will:

The basics of searching biomedical databases. Francesca Frati, MLIS. Learning Outcomes. At the end of this workshop you will: The basics of searching biomedical databases Francesca Frati, MLIS Learning Outcomes At the end of this workshop you will: Be better able to formulate a clear search question Become more familiar with

More information

Go to to discover this essential resource today

Go to   to discover this essential resource today Cochrane User guide What is in The Cochrane Library? The Cochrane Library consists of seven databases and is used by a broad range of people interested in Evidence-Based health care, including consumers,

More information

The Cochrane Library. Reference Guide. Trusted evidence. Informed decisions. Better health.

The Cochrane Library. Reference Guide. Trusted evidence. Informed decisions. Better health. The Cochrane Library Reference Guide Trusted evidence. Informed decisions. Better health. www.cochranelibrary.com Did you know? Did you know? Ten tips for getting the most out of the Cochrane Library 1.

More information

Training Guide on. Searching the Cochrane Library-

Training Guide on. Searching the Cochrane Library- Training Guide on Searching the Cochrane Library- www.thecochranelibrary.com November 2012 This guide is prepared by Daphne Grey and Ziba Nadimi, members of Clinical Librarians and Information Skills Trainers

More information

VIDEO SEARCHING AND BROWSING USING VIEWFINDER

VIDEO SEARCHING AND BROWSING USING VIEWFINDER VIDEO SEARCHING AND BROWSING USING VIEWFINDER By Dan E. Albertson Dr. Javed Mostafa John Fieber Ph. D. Student Associate Professor Ph. D. Candidate Information Science Information Science Information Science

More information

Retrieval Evaluation

Retrieval Evaluation Retrieval Evaluation - Reference Collections Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University References: 1. Modern Information Retrieval, Chapter

More information

Beyond the Basics of a Search Kathi Grainger Sr. Training Manager

Beyond the Basics of a Search Kathi Grainger Sr. Training Manager Beyond the Basics of a Search Kathi Grainger Sr. Training Manager 8/2015 Objectives & agenda Literature Searching Overview, questions and scope EBP Content Platform review Searching Steps PICO question?

More information

SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task

SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task Sungbin Choi, Jinwook Choi Medical Informatics Laboratory, Seoul National University, Seoul, Republic of Korea wakeup06@empas.com, jinchoi@snu.ac.kr

More information

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost

More information

A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH

A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH A thesis Submitted to the faculty of the graduate school of the University of Minnesota by Vamshi Krishna Thotempudi In partial fulfillment of the requirements

More information

In the previous lecture we went over the process of building a search. We identified the major concepts of a topic. We used Boolean to define the

In the previous lecture we went over the process of building a search. We identified the major concepts of a topic. We used Boolean to define the In the previous lecture we went over the process of building a search. We identified the major concepts of a topic. We used Boolean to define the relationships between concepts. And we discussed common

More information

Document Structure Analysis in Associative Patent Retrieval

Document Structure Analysis in Associative Patent Retrieval Document Structure Analysis in Associative Patent Retrieval Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550,

More information

The Cochrane Library. Reference Guide. Trusted evidence. Informed decisions. Better health.

The Cochrane Library. Reference Guide. Trusted evidence. Informed decisions. Better health. The Cochrane Library Reference Guide Trusted evidence. Informed decisions. Better health. www.cochranelibrary.com Did you know? Did you know? Ten tips for getting the most out of the Cochrane Library 1.

More information

SciVerse Scopus. 1. Scopus introduction and content coverage. 2. Scopus in comparison with Web of Science. 3. Basic functionalities of Scopus

SciVerse Scopus. 1. Scopus introduction and content coverage. 2. Scopus in comparison with Web of Science. 3. Basic functionalities of Scopus Prepared by: Jawad Sayadi Account Manager, United Kingdom Elsevier BV Radarweg 29 1043 NX Amsterdam The Netherlands J.Sayadi@elsevier.com SciVerse Scopus SciVerse Scopus 1. Scopus introduction and content

More information

How to Apply Basic Principles of Evidence-

How to Apply Basic Principles of Evidence- CSHP 2015 TOOLKIT FROM PAPER TO PR ACTI CE: INCORPORATING EV IDENCE INTO YOUR PAGE 1 PHARMACY PR ACTICE (O BJECTIVE 3.1) How to Apply Basic Principles of Evidence- Based Practice May 2011 How to Do a Basic

More information

The Process of the Literature Review: Searching & Managing the Literature

The Process of the Literature Review: Searching & Managing the Literature The Process of the Literature Review: Searching & Managing the Literature Susan Meadows, MLS Medical Librarian III, Adjunct Asst. Professor Family and Community Medicine University of Missouri - Columbia

More information

INFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE

INFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE 15 : CONCEPT AND SCOPE 15.1 INTRODUCTION Information is communicated or received knowledge concerning a particular fact or circumstance. Retrieval refers to searching through stored information to find

More information

MeSH: A Thesaurus for PubMed

MeSH: A Thesaurus for PubMed Resources and tools for bibliographic research MeSH: A Thesaurus for PubMed October 24, 2012 What is MeSH? Who uses MeSH? Why use MeSH? Searching by using the MeSH Database What is MeSH? Acronym for Medical

More information

How to Find Evidence When You Need It, Part 3: A Clinician s Guide to MEDLINE: Tricks and Special Skills

How to Find Evidence When You Need It, Part 3: A Clinician s Guide to MEDLINE: Tricks and Special Skills How to Find Evidence When You Need It, Part 3: A Clinician s Guide to MEDLINE: Tricks and Special Skills From the Library of The New York Academy of Medicine, * the Augustus Long Health Sciences Library

More information

Acurian on. The Role of Technology in Patient Recruitment

Acurian on. The Role of Technology in Patient Recruitment Acurian on The Role of Technology in Patient Recruitment Wearables smartphones social networks the list of new technological tools available to patients and healthcare providers goes on and on. Many clinical

More information

Okapi in TIPS: The Changing Context of Information Retrieval

Okapi in TIPS: The Changing Context of Information Retrieval Okapi in TIPS: The Changing Context of Information Retrieval Murat Karamuftuoglu, Fabio Venuti Centre for Interactive Systems Research Department of Information Science City University {hmk, fabio}@soi.city.ac.uk

More information

MeSH : A Thesaurus for PubMed

MeSH : A Thesaurus for PubMed Scuola di dottorato di ricerca in Scienze Molecolari Resources and tools for bibliographic research MeSH : A Thesaurus for PubMed What is MeSH? Who uses MeSH? Why use MeSH? Searching by using the MeSH

More information

Searching the Cochrane Library

Searching the Cochrane Library Searching the Cochrane Library To book your place on the course contact the library team: www.epsom-sthelier.nhs.uk/lis E: hirsonlibrary@esth.nhs.uk T: 020 8296 2430 Learning objectives At the end of this

More information

Welcome to the Louis Calder Memorial Library NW 10 Ave., Miami, FL 33136

Welcome to the Louis Calder Memorial Library NW 10 Ave., Miami, FL 33136 Welcome to the Louis Calder Memorial Library 1601 NW 10 Ave., Miami, FL 33136 Calder Library Website http://calder.med.miami.edu Forms Quick links: e-books e-db s e-journals Resource Guides usearch Featured

More information

Finding Nutrition Information on the Web: Coverage vs. Authority

Finding Nutrition Information on the Web: Coverage vs. Authority Finding Nutrition Information on the Web: Coverage vs. Authority Susan G. Doran Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208.Sue_doran@yahoo.com Samuel

More information

Filtering Web Pages for Quality Indicators: An Empirical Approach to Finding High Quality Consumer Health Information on the World Wide Web

Filtering Web Pages for Quality Indicators: An Empirical Approach to Finding High Quality Consumer Health Information on the World Wide Web Filtering Web Pages for Quality Indicators: An Empirical Approach to Finding High Quality Consumer Health Information on the World Wide Web Susan L. Price, MD, MS and William R. Hersh, MD Division of Medical

More information

ABSTRACT INTRODUCTION AND BACKGROUND

ABSTRACT INTRODUCTION AND BACKGROUND The Low Availability of Metadata Elements for Evaluating the Quality of Medical Information on the World Wide Web John Shon MD, Mark A. Musen MD, PhD Stanford Medical Informatics, Stanford University School

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

Relevance of Google Customized Search Engine vs. CISMeF Quality- Controlled Health Gateway

Relevance of Google Customized Search Engine vs. CISMeF Quality- Controlled Health Gateway Relevance of Google Customized Search Engine vs. CISMeF Quality- Controlled Health Gateway Jean-François Gehanno a, Gaétan Kerdelhué a, Saoussen Sakji a, Philippe Massari a, Michel Joubert b, Stéfan J.

More information

Exploring the Query Expansion Methods for Concept Based Representation

Exploring the Query Expansion Methods for Concept Based Representation Exploring the Query Expansion Methods for Concept Based Representation Yue Wang and Hui Fang Department of Electrical and Computer Engineering University of Delaware 140 Evans Hall, Newark, Delaware, 19716,

More information

Performance Measures for Multi-Graded Relevance

Performance Measures for Multi-Graded Relevance Performance Measures for Multi-Graded Relevance Christian Scheel, Andreas Lommatzsch, and Sahin Albayrak Technische Universität Berlin, DAI-Labor, Germany {christian.scheel,andreas.lommatzsch,sahin.albayrak}@dai-labor.de

More information

SciVerse Scopus. Date: 21 Sept, Coen van der Krogt Product Sales Manager Presented by Andrea Kmety

SciVerse Scopus. Date: 21 Sept, Coen van der Krogt Product Sales Manager Presented by Andrea Kmety SciVerse Scopus Date: 21 Sept, 2011 Coen van der Krogt Product Sales Manager Presented by Andrea Kmety Agenda What is SciVerse? SciVerse Scopus at a glance Supporting Researchers Supporting the Performance

More information

CLARIT Compound Queries and Constraint-Controlled Feedback in TREC-5 Ad-Hoc Experiments

CLARIT Compound Queries and Constraint-Controlled Feedback in TREC-5 Ad-Hoc Experiments CLARIT Compound Queries and Constraint-Controlled Feedback in TREC-5 Ad-Hoc Experiments Natasa Milic-Frayling 1, Xiang Tong 2, Chengxiang Zhai 2, David A. Evans 1 1 CLARITECH Corporation 2 Laboratory for

More information

Preliminary Work on Building a User Friendly Adaptive Clinical Documents Repository

Preliminary Work on Building a User Friendly Adaptive Clinical Documents Repository Preliminary Work on Building a User Friendly Adaptive Clinical Documents Repository Enriko Aryanto Stanford University 121 Campus Dr. #3112A Stanford, CA 94305 1-650-497-7306 earyanto@stanford.edu Yang

More information

The Tagging Tangle: Creating a librarian s guide to tagging. Gillian Hanlon, Information Officer Scottish Library & Information Council

The Tagging Tangle: Creating a librarian s guide to tagging. Gillian Hanlon, Information Officer Scottish Library & Information Council The Tagging Tangle: Creating a librarian s guide to tagging Gillian Hanlon, Information Officer Scottish Library & Information Council Introduction Scottish Library and Information Council (SLIC) advisory

More information

Finding and Evaluating Medical Information on the Internet

Finding and Evaluating Medical Information on the Internet Finding and Evaluating Medical Information on the Internet Peter J. Ziemkowski, MD MSU/Kalamazoo Center for Medical Studies Family Medicine Residency Program Caveat Petitor (Let the Seeker Beware) Peter

More information

Finding the evidence: how to get the most from your searching

Finding the evidence: how to get the most from your searching Finding the evidence: how to get the most from your searching Formulate your PICO question See p. 5. Try secondary sources TRIP Database EBM Online Cochrane Library See p. 10. Choose primary database(s)

More information

The Cochrane Library - Full Guide

The Cochrane Library - Full Guide The Cochrane Library - Full Guide What is the Cochrane Library? The Cochrane Library is the most authoritative and up to date source of evidence based information and can help clinicians make decisions

More information

Tilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval

Tilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval Tilburg University Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval Publication date: 2006 Link to publication Citation for published

More information

M erg in g C lassifiers for Im p ro v ed In fo rm a tio n R e triev a l

M erg in g C lassifiers for Im p ro v ed In fo rm a tio n R e triev a l M erg in g C lassifiers for Im p ro v ed In fo rm a tio n R e triev a l Anette Hulth, Lars Asker Dept, of Computer and Systems Sciences Stockholm University [hulthi asker]ø dsv.su.s e Jussi Karlgren Swedish

More information

EBHC International Library The Project

EBHC International Library The Project 4 th International Conference of Evidence-Based Health Care Teachers & Developers Better Evidence for Better Health Care Taormina (Italy), 31 st October - 4 th November, 2007 EBHC International Library

More information

Supporting Discovery in Medicine by Association Rule Mining in Medline and UMLS

Supporting Discovery in Medicine by Association Rule Mining in Medline and UMLS Supporting Discovery in Medicine by Association Rule Mining in Medline and UMLS Dimitar Hristovski a, Janez Stare a, Borut Peterlin b, Saso Dzeroski c a IBMI, Medical Faculty, University of Ljubljana,

More information

SEARCHING FOR EVIDENCE. Intro to Library Resources

SEARCHING FOR EVIDENCE. Intro to Library Resources SEARCHING FOR EVIDENCE Intro to Library Resources What the Library has? Databases: Medline (Ovid or PubMed) largest health sciences database CINAHL largest nursing database Cochrane Library home of the

More information

A step by step Guide to Searching the Cochrane database

A step by step Guide to Searching the Cochrane database A step by step Guide to Searching the Cochrane database www.knowledgenet.ashfordstpeters.nhs.uk/knet/ 2011 KnowledgeNet www.knowledgenet.ashfordstpeters.nhs.uk/knet/ By the end of this presentation you

More information

Information Retrieval Tools for Efficient Data Searching using Big Data

Information Retrieval Tools for Efficient Data Searching using Big Data ISSN: 2393-8528 Contents lists available at www.ijicse.in International Journal of Innovative Computer Science & Engineering Volume 4 Issue 4; July-August-2017; Page No. 06-12 Information Retrieval Tools

More information

formula that gave the highest weight to terms occurring frequently in a small number of documents [10]. In retrieval, the user query was processed to

formula that gave the highest weight to terms occurring frequently in a small number of documents [10]. In retrieval, the user query was processed to The SAPHIRE Server: SAPHIRE is an experimental information retrieval system implemented to test new approaches to automated indexing and retrieval of medical documents. Due to limitations in its original

More information

COCHRANE LIBRARY. Contents

COCHRANE LIBRARY. Contents COCHRANE LIBRARY Contents Introduction... 2 Learning outcomes... 2 About this workbook... 2 1. Getting Started... 3 a. Finding the Cochrane Library... 3 b. Understanding the databases in the Cochrane Library...

More information

Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts

Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts Kwangcheol Shin 1, Sang-Yong Han 1, and Alexander Gelbukh 1,2 1 Computer Science and Engineering Department, Chung-Ang University,

More information

From Passages into Elements in XML Retrieval

From Passages into Elements in XML Retrieval From Passages into Elements in XML Retrieval Kelly Y. Itakura David R. Cheriton School of Computer Science, University of Waterloo 200 Univ. Ave. W. Waterloo, ON, Canada yitakura@cs.uwaterloo.ca Charles

More information

Descriptor and Citation Retrieval in the Medical Behavioral Sciences Literature: Retrieval Overlaps and Novelty Distribution*

Descriptor and Citation Retrieval in the Medical Behavioral Sciences Literature: Retrieval Overlaps and Novelty Distribution* Descriptor and Citation Retrieval in the Medical Behavioral Sciences Literature: Retrieval Overlaps and Novelty Distribution* Katherine W. McCain College of Information Studies, Drexel University, Philadelphia,

More information

How to do a Literature Search? Department of Neonatology, Sanjay Gandhi Postgraduate Institute of Medical Sciences, Lucknow, UP, India

How to do a Literature Search? Department of Neonatology, Sanjay Gandhi Postgraduate Institute of Medical Sciences, Lucknow, UP, India Cronicon OPEN ACCESS EC PAEDIATRICS Short Communication How to do a Literature Search? Namita Mishra, Girish Gupta, Aakash Pandita* and Amit Shukla Department of Neonatology, Sanjay Gandhi Postgraduate

More information

Information Retrieval. CS630 Representing and Accessing Digital Information. What is a Retrieval Model? Basic IR Processes

Information Retrieval. CS630 Representing and Accessing Digital Information. What is a Retrieval Model? Basic IR Processes CS630 Representing and Accessing Digital Information Information Retrieval: Retrieval Models Information Retrieval Basics Data Structures and Access Indexing and Preprocessing Retrieval Models Thorsten

More information

Document Retrieval using Predication Similarity

Document Retrieval using Predication Similarity Document Retrieval using Predication Similarity Kalpa Gunaratna 1 Kno.e.sis Center, Wright State University, Dayton, OH 45435 USA kalpa@knoesis.org Abstract. Document retrieval has been an important research

More information

ResPubliQA 2010

ResPubliQA 2010 SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first

More information

THE MEDLINE database was created and is

THE MEDLINE database was created and is 292 MEDLINE SEARCHES Federiuk ABBREVIATIONS IN MEDLINE SEARCHING The Effect of Abbreviations on MEDLINE Searching CAROL S. FEDERIUK, MD, PHD Abstract. Objective: To determine the effect of the use of abbreviations

More information

Challenges in building systematic search strategies to recognize the complexities of Public Health reviews

Challenges in building systematic search strategies to recognize the complexities of Public Health reviews Review article Noronha, J. A., et al: Systematic search strategies for public Health reviews Challenges in building systematic search strategies to recognize the complexities of Public Health reviews Preethy

More information

This paper studies methods to enhance cross-language retrieval of domain-specific

This paper studies methods to enhance cross-language retrieval of domain-specific Keith A. Gatlin. Enhancing Cross-Language Retrieval of Comparable Corpora Through Thesaurus-Based Translation and Citation Indexing. A master s paper for the M.S. in I.S. degree. April, 2005. 23 pages.

More information

Literature Searching Skills for Clinical Audit

Literature Searching Skills for Clinical Audit Literature Searching Skills for Clinical Audit www.knowledgenet.ashfordstpeters.nhs.uk/knet/ 2014 1 Aims and Objectives Aim To carry out an effective literature search of healthcare databases and other

More information

Controlled Medical Vocabulary in the CPR Generations

Controlled Medical Vocabulary in the CPR Generations Tutorials, B. Hieb, M.D. Research Note 5 November 2003 Controlled Medical Vocabulary in the CPR Generations A CMV capability becomes progressively more important as computer-based patient record systems

More information

Introduction to Ovid Search, Discover, Learn

Introduction to Ovid Search, Discover, Learn Introduction to Ovid Search, Discover, Learn Michael Fanning Customer Success - Training Manager Health Learning, Research and Practice Romanian International Conference for Research and Education Publisher

More information

Evidence-Based Resources in Behavioral Health

Evidence-Based Resources in Behavioral Health Evidence-Based Resources in Behavioral Health Native American Behavioral Health Conference Washington, D.C. July 26, 2018 Diane Cooper National Institutes of Health Audience Survey Thank you! Objectives

More information

Biomedical Digital Libraries

Biomedical Digital Libraries Biomedical Digital Libraries BioMed Central Resource review database: a review Judy F Burnham* Open Access Address: University of South Alabama Biomedical Library, 316 BLB, Mobile, AL 36688, USA Email:

More information

Protecting information across government

Protecting information across government Report by the Comptroller and Auditor General Cabinet Office Protecting information across government HC 625 SESSION 2016-17 14 SEPTEMBER 2016 4 Key facts Protecting information across government Key facts

More information

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND 41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia

More information

INFORMATION RETRIEVAL SYSTEMS: Theory and Implementation

INFORMATION RETRIEVAL SYSTEMS: Theory and Implementation INFORMATION RETRIEVAL SYSTEMS: Theory and Implementation THE KLUWER INTERNATIONAL SERIES ON INFORMATION RETRIEVAL Series Editor W. Bruce Croft University of Massachusetts Amherst, MA 01003 Also in the

More information

RMIT University at TREC 2006: Terabyte Track

RMIT University at TREC 2006: Terabyte Track RMIT University at TREC 2006: Terabyte Track Steven Garcia Falk Scholer Nicholas Lester Milad Shokouhi School of Computer Science and IT RMIT University, GPO Box 2476V Melbourne 3001, Australia 1 Introduction

More information

Multimedia Information Systems

Multimedia Information Systems Multimedia Information Systems Samson Cheung EE 639, Fall 2004 Lecture 6: Text Information Retrieval 1 Digital Video Library Meta-Data Meta-Data Similarity Similarity Search Search Analog Video Archive

More information

Criteria and methods in evaluation of digital libraries: use & usability

Criteria and methods in evaluation of digital libraries: use & usability Criteria and methods in evaluation of digital libraries: use & usability, Ph.D. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States License 1 Evaluating

More information

Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014

Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014 Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014 Sungbin Choi, Jinwook Choi Medical Informatics Laboratory, Seoul National University, Seoul, Republic of

More information

Translating evidence into clinical relevance seminar: Searching the literature

Translating evidence into clinical relevance seminar: Searching the literature Translating evidence into clinical relevance seminar: Searching the literature Kaye Lasserre Subject Librarian, Monash University kaye.lasserre@monash.edu Better research skills = efficient and effective

More information

Cochrane Library guide

Cochrane Library guide Cochrane Library guide Last revised February 2015 Library Leabharlann The Cochrane Library is available free of charge (funded by the Health Research Board) in the Republic of Ireland at www.cochranelibrary.com.

More information

Advanced searching. You can save valuable time by planning your search properly. 1. What information do I need?

Advanced searching. You can save valuable time by planning your search properly. 1. What information do I need? Advanced searching You can save valuable time by planning your search properly. 1. Identify your information needs 2. Identify your keywords and subject headings to construct your search 3. Identify appropriate

More information

Recertification Handbook

Recertification Handbook Recertification Handbook NACAS 3 Boar's Head Lane, Suite B Charlottesville, VA 22903 Phone 434-245-8425 Fax 434-245-8453 nacas.org/casp casp@nacas.org Congratulations on receiving the CASP designation!

More information

The Cochrane Library on Wiley InterScience Quick Reference Guide.

The Cochrane Library on Wiley InterScience Quick Reference Guide. The Cochrane Library on Wiley InterScience Quick Reference Guide www.thecochranelibrary.com 1 The Cochrane Library on Wiley InterScience Quick Reference Guide Cochrane Database of Systematic Reviews (Cochrane

More information

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm K.Parimala, Assistant Professor, MCA Department, NMS.S.Vellaichamy Nadar College, Madurai, Dr.V.Palanisamy,

More information

In the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google,

In the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google, 1 1.1 Introduction In the recent past, the World Wide Web has been witnessing an explosive growth. All the leading web search engines, namely, Google, Yahoo, Askjeeves, etc. are vying with each other to

More information

Building Test Collections. Donna Harman National Institute of Standards and Technology

Building Test Collections. Donna Harman National Institute of Standards and Technology Building Test Collections Donna Harman National Institute of Standards and Technology Cranfield 2 (1962-1966) Goal: learn what makes a good indexing descriptor (4 different types tested at 3 levels of

More information

University of Oklahoma Health Sciences Center. Robert M. Bird Health Sciences Library. Marty Thompson

University of Oklahoma Health Sciences Center. Robert M. Bird Health Sciences Library. Marty Thompson University of Oklahoma Health Sciences Center Robert M. Bird Health Sciences Library Marty Thompson Methods of Contacting BHS Library Home page: http://library.ouhsc.edu Phone Numbers 405 271 2285 800

More information

FDA & Medical Device Cybersecurity

FDA & Medical Device Cybersecurity FDA & Medical Device Cybersecurity Closing Keynote, February 19, 2017 Suzanne B. Schwartz, M.D., MBA Associate Director for Science & Strategic Partnerships Center for Devices and Radiological Health US

More information

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Shigeo Sugimoto Research Center for Knowledge Communities Graduate School of Library, Information

More information

Managing the literature in systematic reviews. Marshall Dozier

Managing the literature in systematic reviews. Marshall Dozier Managing the literature in systematic reviews Marshall Dozier What are we talking about? What makes a literature review systematic? What s the difference between a systematic review and meta-analysis?

More information

PubMed Basics. Stephanie Friree Outreach and Technology Coordinator NN/LM New England Region (800)

PubMed Basics. Stephanie Friree Outreach and Technology Coordinator NN/LM New England Region (800) PubMed Basics 1 PubMed Basics 2 PubMed Basics 3 PubMed Basics Stephanie Friree Outreach and Technology Coordinator stephanie.friree@umassmed.edu NN/LM New England Region (800) 338-7657 Overview! Introductions!

More information