Using Text Elements by Context to Display Search Results in Information Retrieval Systems Model and Research results

Size: px
Start display at page:

Download "Using Text Elements by Context to Display Search Results in Information Retrieval Systems Model and Research results"

Transcription

1 Using Text Elements by Context to Display Search Results in Information Retrieval Systems Model and Research results Offer Drori SHAAM Information Systems The Hebrew University of Jerusalem {shaam.gov.il, cs.huji.ac.il} Tel Fax Abstract Information retrieval systems display search results by various methods. This paper focuses on a model for displaying a list of search results by means of textual elements that utilize a new information unit that replaces the currently used information unit. The paper includes a short description of several studies that support the model. 1. Introduction Because of the growth in the number and scope of global databases, a special approach is required to locating information, from the perspective of the user interface. The Internet, as it exists today, is an outstanding example of a broad-base, unfocused database. Most Internet search engines display their information as a serially ordered list of results (with a partial attempt at ranking the results). In most cases, this list includes the document title, URL and, at times, the first few lines of the document. The information, as currently displayed to the user, is incomplete and insufficiently focused on the search query. This requires the user to actually read all the documents in the list with being able to discriminate. With today s search engines most of the search transactions yield a list of hundreds and even thousands of documents, while studies show that the average user only looks at the first 1-2 results (Kirsch 1998). Finding a solution to this paradox presents a serious challenge to researchers in the field. This paper will suggest a way to locate the relevant document without having to read the listed documents. 2. A model for displaying textual search results To deal with the challenge presented above, this section will define a hierarchical structure containing three levels for displaying search results (see Figure No. 1). Search results can be displayed from textual databases by relying on two basic principles; visualization of the results, and the use of textual components to design the list of results. This article focuses solely on the use of textual components to display search results, where the textual component consists of two categories: internal document information and external document information. 2.1 Results based on internal document information In this category, a number of techniques are used, most of which include information components related to the search topic. Following is a description of the various methods. Significant sentences Significant sentences can be descriptive sentences based on defined paragraphs in the document, for example: Abstract, Introduction, Conclusion. Alternatively, sentences relevant to the search query can be 1

2 used, which include the terms that were the reason for the document being chosen (Luhn 196), (Drori 1998). Significant words Significant words in the document are intrinsic descriptives, such as keywords or frequently repeated words. The document s author determines keywords, or they can be produced automatically. Frequently repeated words that are computer generated (including Stop List operation) can yield results that are similar but less exact (Baldonado & Winograd 1997). Information from HTML tags The language tags can provide us with information about the document. For example, paragraph or subtitle headings can be located by using the <H> tags, and can even be used to generate a table of contents. <META> tags contain information about the document as recorded by the document author, such as: abstract, keywords, and others. A certain amount of noise must be taken into account with these tags because of commercial rating considerations. The following studies utilized tables of contents: Egan et al 1989, Chimera & Shneiderman 1994, and Hertzum & Frokjaer Additional information Additional information can be generated from within the actual document; for example, when a document includes citations from other documents, the titles of the cited documents can be used, assuming that they have a subject in common (Pitkow & Pirolli 1997). Displaying Search Results Techniques Displaying Search Results Textual Techniques Graphic Techniques Internal Document Information External Document Information Significant Sentences Significant Words Information from HTML Tags Additional Information Document Classification Cited Documents Information from the Data Base Figure No. 1 - Model for displaying search results 2.2 Results based on external document information This category utilizes a number of methods that include information components based on the document s subject field and not contained within the actual document. A description of these methods follows. Document classification This method displays the category with which the document is associated. Search engines that manually define document categories (such as Yahoo) can be used for this purpose. It is also possible to create categories with the aid of computerized algorithms, and the subject association of the document can be established by clustering all the search results (Allen et al 1993), (Zamir & Etzioni 1999). 2

3 Citing documents This refers to a situation in which one document cites another, where both have a subject in common. The citing documents can be located directly via the Internet, or by using a subject-oriented database such as the Science Citation Index. When the citing documents are located, either their titles or, alternatively, their cited paragraphs can be used (Amento et al 1999). Information from the database The database in which the document is located can provide an indication of the document s subject in a number of ways. Subject oriented databases usually specify the database subject field. An attempt can be made to determine the database subject field from the titles of additional documents contained in the database. 3. Research into displaying search results The research objective was to locate those information components in the search result display that are most relevant to the user, in order to make the task of locating information both more efficient and more effective. The research questions were: 1. What are the most important information details for display in the search results? 2. When comparing the various methods of displaying search result information, which method is preferable in terms of accomplishing the user s task? The research agenda included the performance of various search tasks by a number of user groups. The tasks were carried out using various interfaces. For research purposes, a database was created that included response documents (in English) for defined search queries. In all the studies 3-4 different interfaces were used. The effectiveness of the search, and user satisfaction, were checked using two dimensions: Objective data: response time and accuracy. Subjective data: convenience, sense of confidence, satisfaction, and the relevancy of information components. The participants in the research came from several groups and included students from the School of Business Administration (MBA), technical support personnel from the computer field, and information specialists and librarians from the information field. Statistical analyses included the Anova Test for examining the significance of the difference in the methods, the P Test to determine the significance of the results and, naturally, standard statistical analyses of averages, standard deviations, etc. In the initial study, 128 participants worked with 3 different interfaces. The subject examined was the contribution made by displaying lines of information from the document in addition to the title. The interfaces were: T - titles only; TFL - titles + first lines from the document head (refers to internal document information/significant sentences/descriptive sentences of the model in figure 1); TLC - titles + lines by search context (refers to internal document information/significant sentences/ sentences relevant to the search query of the model). 3

4 Research 1 - The difference between the methods T TFL TLC Time (sec.) Difficult tasks Simple tasks Comfort Confidence Relevancy T TFL TLC methods A significant difference was found between the methods (P<.1). The TLC method (displaying lines by search context) was preferable in all aspects of the subjective dimension (search convenience, feeling of confidence during use, and relevancy of information). For the objective dimension of search time, the TLC method had an advantage (31%) in the case of complicated tasks, while the other methods had an advantage when handling simple or moderately complex tasks. Research 1 Snap shot TLC interface ( Titles in blue, Lines in context in black, Search terms in blue) 4

5 In the second study, 51 participants worked with 3 different interfaces. The subject checked was the contribution made by displaying keywords for the information items that were displayed in the first study. Keywords refers to internal document information/significant words/intrinsic descriptive of the model in figure 1. The interfaces were: TK - titles + keywords; TFLK - titles + first lines from the document head + keywords; TLCK - titles + lines by search context + keywords. Research 2 Snap shot TLCK interface ( Titles in blue, Key words in green, Lines in context in black, Search terms in red) A significant difference was found between the methods (P<.1). The TLCK method (lines by search context + keywords) is preferable for the subjective dimension. The TLCK method possesses an advantage (33%) over the other methods, in the case of search times for moderate and difficult tasks. 5

6 Research 2 - The differences between the methods Comfort Confidence Relevancy TK TFLK TLCK Time (sec.) TK TFLK TLCK Methods Difficult tasks Simple tasks A significant difference was found between the methods (P<.1). The TLCK method (lines by search context + keywords) is preferable for the subjective dimension. The TLCK method possesses an advantage (33%) over the other methods, in the case of search times for moderate and difficult tasks. In the third study, 75 participants worked with 4 interfaces. The subject checked was the contribution made by displaying the document category in addition to lines from the document (category refers to external document information/documents classification of the model in figure 1). The interfaces were: TFL titles + first lines; TFLC titles + first lines + categories; TLC titles + lines by search context; TLCC titles + lines by search context + categories. Research 3 - The differences between the methods Comfort Confidence Relevancy TFL TFLC TLC TLCC Time (sec.) TFL TFLC TLC TLCC Methods Difficult tasks Simple tasks A significant difference was found between the methods (P<.1). The TLCC method possesses an advantage in the subjective dimension and also in search times (67%) across all task difficulty levels. In this study, we also examined which search results display parameters are important to the user. The findings showed that confidence in the answer is the most important parameter (78%), followed by search time (73%) and then the ability to find the answer without reading all the documents (54%). User convenience was found to be a less important parameter for the search process (44%). 6

7 Research 3 Snap shot TLCC interface ( Titles in blue, Categories in red, Lines in context in black) In the forth study, 61 participants worked with 4 interfaces. The subject checked was the contribution made by displaying the document category, address, common words and the organization that published the paper in addition to lines from the document (category refers to external document information/documents classification of the model in figure 1. Document address refers to external document information/information from the data base. Common words refers to internal document information/significant words and the organization that published the paper refers to external document information/information from the data base). The interfaces were: TLCC titles + lines by search context + categories; TLCA titles + lines by search context +internet address; TLCCW titles + lines by search context + common words; TLCO titles + lines by search context + organization name. 7

8 Research 4 - The differences between the methods % (-high; -low) Comfort Confidence Relevancy TLCC TLCA TLCCW TLCO Time (min.) TLCC TLCA TLCCW TLCO Methods Difficult tasks Simple tasks A significant difference was found between some of the methods (P<.1). The TLCCW method possesses an advantage in the subjective dimension and also in search times across all task difficulty levels. In this study, we also examined which search results display parameters are important to the user. The findings showed that the ability to find the answer without reading all the documents is the most important parameter (87%), followed by confidence (77%) and than search time (67%). User convenience was found to be a less important parameter for the search process (51%). 8

9 Research 4 Snap shot TLCCW interface ( Titles in blue, Common words in red, Lines in context in black, Search terms in bold) 4. Conclusion and findings The objective of the studies was to examine some of the components of the search results display model. The studies that were performed enable the definition of a new information unit that can replace the unit currently used. We found that, in addition to the title, the alternative information unit must include lines by search context, keywords, and an indication of the document category. Document category can be accomplished by common words. Authors of article and database administrators can benefit by including the suggested information components in each document using standardized means (such as XML). An interesting finding was revealed in the feedback on the parameters that the users considered important when using data retrieval systems. The feeling of confidence when using a system is perceived as having a higher priority than speed and locating the answer without having to read the list of documents. On the other hand, users assigned ease of use a low priority. A study planned by us will include an in-depth evaluation of additional information components of the model. Acknowledgements I wish to thank Eliezer Lozinski for his excellent suggestions in the course of the research. My thanks also to Nir Alon and Aliza Weisberg for their assistance in gathering the data in Research 3 and to Liora Halevi and Yifat Betser-Nahum for their assistance in gathering the data in Research 4. 9

10 References 1. Allen, R., Obry, P., Littman, M., An interface for navigating clustered document sets returned by queries, Proceedings of the ACM Conference on Organizational Computing Systems (COOCS93), 1993, Amento, B., Hill, W., Terveen, L., Hix, D., Ju, P., An empirical Evaluation of User interfaces for Topic Management of Web Sites, Proceedings of CHI'99, ACM Press, Pittsburg PA, May 1999, Baldonado, M., Winograd, T., SenseMaker: an information-exploration interface supporting the contextual evolution of a user's interests, Proceedings of CHI97, 1997, Chimera, R., Shneiderman, B., An Exploratory Evaluation of Three Interfaces for Browsing Large Hirarchical Tables of Contents, ACM Transaction in Information Systems, New York: ACM, 12 (4), October 1994, Drori, O., The User Interface in Text Retrieval Systems, SIGCHI bulletin, New York: ACM, July 1998, 3 (3), Egan, D., Remde, J., Landauer, T., Lochbaum, C., Gomez, L., Behavioral Evaluation and Analysis of a Hypertext Browser, Proceedings of CHI '89, New York: ACM, 1989, Hertzum, M., Frokjaer, E., Browsing and Querying in Online Documentation: A Study of User Interface and the Interaction Process, ACM Transaction on Computer-Human Interection, New York: ACM, 3 (2), June 1996, Kirsh, S., Infoseek's experiences searching the Internet, SIGIR Forum, New York: ACM, Vol. 32, 1998, Num. 2, Luhn, H., Keyword in Context Index for Technical Literature, American Documentation, XI (4), 196, Pitkow, J., Pirolli, P., Life, Death, and Lawfulness on the Electronic frontier, CHI '97 Proceedings, New York: ACM, April 1996, Zamir, O., Etzioni, O., Grouper: A Dynamic Clustering Interface to Web Search Results, WWW8 Proceedings, Toronto: WWW,

A World Wide Web-based HCI-library Designed for Interaction Studies

A World Wide Web-based HCI-library Designed for Interaction Studies A World Wide Web-based HCI-library Designed for Interaction Studies Ketil Perstrup, Erik Frøkjær, Maria Konstantinovitz, Thorbjørn Konstantinovitz, Flemming S. Sørensen, Jytte Varming Department of Computing,

More information

Using Clusters on the Vivisimo Web Search Engine

Using Clusters on the Vivisimo Web Search Engine Using Clusters on the Vivisimo Web Search Engine Sherry Koshman and Amanda Spink School of Information Sciences University of Pittsburgh 135 N. Bellefield Ave., Pittsburgh, PA 15237 skoshman@sis.pitt.edu,

More information

Using Text Analysis to Inform Clients of the Subject of a Document

Using Text Analysis to Inform Clients of the Subject of a Document Informing Science InSITE - Where Parallels Intersect June 2003 of the Subject of a Document Offer Drori Hebrew University of Jerusalem and SHAAM Information Systems, Jerusalem, Israel offerd@cs.huji.ac.il

More information

Domain Specific Search Engine for Students

Domain Specific Search Engine for Students Domain Specific Search Engine for Students Domain Specific Search Engine for Students Wai Yuen Tang The Department of Computer Science City University of Hong Kong, Hong Kong wytang@cs.cityu.edu.hk Lam

More information

AN OVERVIEW OF SEARCHING AND DISCOVERING WEB BASED INFORMATION RESOURCES

AN OVERVIEW OF SEARCHING AND DISCOVERING WEB BASED INFORMATION RESOURCES Journal of Defense Resources Management No. 1 (1) / 2010 AN OVERVIEW OF SEARCHING AND DISCOVERING Cezar VASILESCU Regional Department of Defense Resources Management Studies Abstract: The Internet becomes

More information

An Empirical Evaluation of User Interfaces for Topic Management of Web Sites

An Empirical Evaluation of User Interfaces for Topic Management of Web Sites An Empirical Evaluation of User Interfaces for Topic Management of Web Sites Brian Amento 1,2, Will Hill 1, Loren Terveen 1, Deborah Hix 2, and Peter Ju 1 1 AT&T Labs - Research 180 Park Avenue, P.O. Box

More information

Shared Sensemaking: Enhancing the Value of Collaborative Web Search Tools

Shared Sensemaking: Enhancing the Value of Collaborative Web Search Tools Shared Sensemaking: Enhancing the Value of Collaborative Web Search Tools Meredith Ringel Morris Microsoft Research Redmond, WA, USA merrie@microsoft.com Saleema Amershi University of Washington Seattle,

More information

How to use the SRI Research Network s Zotero-based Library

How to use the SRI Research Network s Zotero-based Library How to use the SRI Research Network s Zotero-based Library Welcome to the SRI Research Network s database for scientific articles about the System of Rice Intensification (SRI). The instructions below

More information

Information Gathering Support Interface by the Overview Presentation of Web Search Results

Information Gathering Support Interface by the Overview Presentation of Web Search Results Information Gathering Support Interface by the Overview Presentation of Web Search Results Takumi Kobayashi Kazuo Misue Buntarou Shizuki Jiro Tanaka Graduate School of Systems and Information Engineering

More information

Context Based Web Indexing For Semantic Web

Context Based Web Indexing For Semantic Web IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 12, Issue 4 (Jul. - Aug. 2013), PP 89-93 Anchal Jain 1 Nidhi Tyagi 2 Lecturer(JPIEAS) Asst. Professor(SHOBHIT

More information

Optimizing Search by Showing Results In Context

Optimizing Search by Showing Results In Context Optimizing Search by Showing Results In Context Susan Dumais and Edward Cutrell Microsoft Research One Microsoft Way Redmond, WA 98052 [sdumais cutrell]@microsoft.com ABSTRACT We developed and evaluated

More information

Knowledge enrichment through dynamic annotation

Knowledge enrichment through dynamic annotation Knowledge enrichment through dynamic annotation Abstract This paper describes a technique for interceding between users and the information that they browse. This facility, that we term dynamic annotation,

More information

Evolution of the Walden s Paths Authoring Tools

Evolution of the Walden s Paths Authoring Tools Evolution of the Walden s Paths Authoring Tools Unmil P. Karadkar, Luis Francisco-Revilla, Richard Furuta, Haowei Hsieh, Frank M. Shipman III Center for the Study of Digital Libraries and Department of

More information

An Analysis of Image Retrieval Behavior for Metadata Type and Google Image Database

An Analysis of Image Retrieval Behavior for Metadata Type and Google Image Database An Analysis of Image Retrieval Behavior for Metadata Type and Google Image Database Toru Fukumoto Canon Inc., JAPAN fukumoto.toru@canon.co.jp Abstract: A large number of digital images are stored on the

More information

Organizing Topic-Specific Web Information

Organizing Topic-Specific Web Information Organizing Topic-Specific Web Information Sougata Mukherjea, C&C Research Laboratories, NEC USA Inc, San Jose, Ca, USA ABSTRACT With the explosive growth of the World-Wide Web, it is becoming increasingly

More information

Introduction to Web Clustering

Introduction to Web Clustering Introduction to Web Clustering D. De Cao R. Basili Corso di Web Mining e Retrieval a.a. 2008-9 June 26, 2009 Outline Introduction to Web Clustering Some Web Clustering engines The KeySRC approach Some

More information

How are XML-based Marc21 and Dublin Core Records Indexed and ranked by General Search Engines in Dynamic Online Environments?

How are XML-based Marc21 and Dublin Core Records Indexed and ranked by General Search Engines in Dynamic Online Environments? How are XML-based Marc21 and Dublin Core Records Indexed and ranked by General Search Engines in Dynamic Online Environments? A. Hossein Farajpahlou Professor, Dept. Lib. and Info. Sci., Shahid Chamran

More information

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost

More information

A NEW PERFORMANCE EVALUATION TECHNIQUE FOR WEB INFORMATION RETRIEVAL SYSTEMS

A NEW PERFORMANCE EVALUATION TECHNIQUE FOR WEB INFORMATION RETRIEVAL SYSTEMS A NEW PERFORMANCE EVALUATION TECHNIQUE FOR WEB INFORMATION RETRIEVAL SYSTEMS Fidel Cacheda, Francisco Puentes, Victor Carneiro Department of Information and Communications Technologies, University of A

More information

SSRN PAPER SUBMISSION PROCESS

SSRN PAPER SUBMISSION PROCESS SSRN PAPER SUBMISSION PROCESS Easy Submission Steps Submission Checklist A free SSRN User Account is required to submit a paper. You will also need... Paper title (in English) Date the paper was written

More information

Automated Cognitive Walkthrough for the Web (AutoCWW)

Automated Cognitive Walkthrough for the Web (AutoCWW) CHI 2002 Workshop: Automatically Evaluating the Usability of Web Sites Workshop Date: April 21-22, 2002 Automated Cognitive Walkthrough for the Web (AutoCWW) Position Paper by Marilyn Hughes Blackmon Marilyn

More information

Enhancing E-Journal Access In A Digital Work Environment

Enhancing E-Journal Access In A Digital Work Environment Enhancing e-journal access in a digital work environment Foo, S. (2006). Singapore Journal of Library & Information Management, 34, 31-40. Enhancing E-Journal Access In A Digital Work Environment Schubert

More information

WESTLAW INTERNATIONAL Quick Guide

WESTLAW INTERNATIONAL Quick Guide WESTLAW INTERNATIONAL Quick Guide This Quick Guide contains step-by-step explanations and screen shots to help you use the Westlaw service. It provides an overview and introduces key features that can

More information

Enhanced Visualization for Web-Based Summaries

Enhanced Visualization for Web-Based Summaries Enhanced Visualization for Web-Based Summaries Brent Wenerstrom Computer Eng. and Computer Science Dept. Duthie Center for Engineering Louisville, Kentucky 40292 brent.wenerstrom@louisville.edu Mehmed

More information

A New Technique to Optimize User s Browsing Session using Data Mining

A New Technique to Optimize User s Browsing Session using Data Mining Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

Query Modifications Patterns During Web Searching

Query Modifications Patterns During Web Searching Bernard J. Jansen The Pennsylvania State University jjansen@ist.psu.edu Query Modifications Patterns During Web Searching Amanda Spink Queensland University of Technology ah.spink@qut.edu.au Bhuva Narayan

More information

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Thaer Samar 1, Alejandro Bellogín 2, and Arjen P. de Vries 1 1 Centrum Wiskunde & Informatica, {samar,arjen}@cwi.nl

More information

Deep Web Content Mining

Deep Web Content Mining Deep Web Content Mining Shohreh Ajoudanian, and Mohammad Davarpanah Jazi Abstract The rapid expansion of the web is causing the constant growth of information, leading to several problems such as increased

More information

Instant Content Creator. User Guide

Instant Content Creator. User Guide Instant Content Creator User Guide Table of contents: 1 INTRODUCTION...4 1.1 Installation Procedure...4 2 INSTANT CONTENT CREATOR INTERFACE...7 3 CREATING A NEW PROJECT...9 4 ENTERING THE NAME OF THE PRODUCT...10

More information

Analysis of Behavior of Parallel Web Browsing: a Case Study

Analysis of Behavior of Parallel Web Browsing: a Case Study Analysis of Behavior of Parallel Web Browsing: a Case Study Salman S Khan Department of Computer Engineering Rajiv Gandhi Institute of Technology, Mumbai, Maharashtra, India Ayush Khemka Department of

More information

Introduction to the Internet and Web

Introduction to the Internet and Web Introduction to the Internet and Web Internet It is the largest network in the world that connects hundreds of thousands of individual networks all over the world. The popular term for the Internet is

More information

Meter Trouble Report PUBLIC. A Guide for Market Participants. Issue 6.0 IMP_GDE_0098

Meter Trouble Report PUBLIC. A Guide for Market Participants. Issue 6.0 IMP_GDE_0098 PUBLIC IMP_GDE_0098 + Meter Trouble Report A Guide for Market Participants Issue 6.0 This document is a guide for market participants to the use of the Meter Trouble Report workflow application. Public

More information

User Guide. Version 1.5 Copyright 2006 by Serials Solutions, All Rights Reserved.

User Guide. Version 1.5 Copyright 2006 by Serials Solutions, All Rights Reserved. User Guide Version 1.5 Copyright 2006 by Serials Solutions, All Rights Reserved. Central Search User Guide Table of Contents Welcome to Central Search... 3 Starting Your Search... 4 Basic Search & Advanced

More information

Quoogle: A Query Expander for Google

Quoogle: A Query Expander for Google Quoogle: A Query Expander for Google Michael Smit Faculty of Computer Science Dalhousie University 6050 University Avenue Halifax, NS B3H 1W5 smit@cs.dal.ca ABSTRACT The query is the fundamental way through

More information

An Empirical Evaluation of User Interfaces for Topic Management of Web Sites

An Empirical Evaluation of User Interfaces for Topic Management of Web Sites An Empirical Evaluation of User Interfaces for Topic Management of Web Sites Brian Amento AT&T Labs - Research 180 Park Avenue, P.O. Box 971 Florham Park, NJ 07932 USA brian@research.att.com ABSTRACT Topic

More information

BUILDING A CONCEPTUAL MODEL OF THE WORLD WIDE WEB FOR VISUALLY IMPAIRED USERS

BUILDING A CONCEPTUAL MODEL OF THE WORLD WIDE WEB FOR VISUALLY IMPAIRED USERS 1 of 7 17/01/2007 10:39 BUILDING A CONCEPTUAL MODEL OF THE WORLD WIDE WEB FOR VISUALLY IMPAIRED USERS Mary Zajicek and Chris Powell School of Computing and Mathematical Sciences Oxford Brookes University,

More information

Visualization of User Eye Movements for Search Result Pages

Visualization of User Eye Movements for Search Result Pages Visualization of User Eye Movements for Search Result Pages Yuka Egusa National Institute for Educational Policy Research 3-2-2 Kasumigaseki, Chiyoda-ku, Tokyo 100-8951, Japan yuka@nier.go.jp Hitoshi Terai

More information

Student Usability Project Recommendations Define Information Architecture for Library Technology

Student Usability Project Recommendations Define Information Architecture for Library Technology Student Usability Project Recommendations Define Information Architecture for Library Technology Erika Rogers, Director, Honors Program, California Polytechnic State University, San Luis Obispo, CA. erogers@calpoly.edu

More information

Automated Online News Classification with Personalization

Automated Online News Classification with Personalization Automated Online News Classification with Personalization Chee-Hong Chan Aixin Sun Ee-Peng Lim Center for Advanced Information Systems, Nanyang Technological University Nanyang Avenue, Singapore, 639798

More information

IJMIE Volume 2, Issue 9 ISSN:

IJMIE Volume 2, Issue 9 ISSN: WEB USAGE MINING: LEARNER CENTRIC APPROACH FOR E-BUSINESS APPLICATIONS B. NAVEENA DEVI* Abstract Emerging of web has put forward a great deal of challenges to web researchers for web based information

More information

The State of Website Accessibility in Higher Education

The State of Website Accessibility in Higher Education Association for Information Systems AIS Electronic Library (AISeL) SAIS 2007 Proceedings Southern (SAIS) 3-1-2007 The State of Website Accessibility in Higher Education Kevin S. Floyd kfloyd@mail.maconstate.edu

More information

6367(Print), ISSN (Online) Volume 4, Issue 3, May June (2013), IAEME & TECHNOLOGY (IJCET)

6367(Print), ISSN (Online) Volume 4, Issue 3, May June (2013), IAEME & TECHNOLOGY (IJCET) INTERNATIONAL International Journal of Computer JOURNAL Engineering OF COMPUTER and Technology ENGINEERING (IJCET), ISSN 0976- & TECHNOLOGY (IJCET) ISSN 0976 6367(Print) ISSN 0976 6375(Online) Volume 4,

More information

CCH INCORPORATED 05/03

CCH INCORPORATED 05/03 Button Bar Main Menu Return to the main tab level of the current library without deselecting your targets Preferences Set preferences for searching, document display, and other functions Citation Search

More information

Dynamic Visualization of Hubs and Authorities during Web Search

Dynamic Visualization of Hubs and Authorities during Web Search Dynamic Visualization of Hubs and Authorities during Web Search Richard H. Fowler 1, David Navarro, Wendy A. Lawrence-Fowler, Xusheng Wang Department of Computer Science University of Texas Pan American

More information

The Role of Information Scent in On-line Browsing:

The Role of Information Scent in On-line Browsing: The Role of Information Scent in On-line Browsing: Extensions of the ACT-R Utility and Concept Formation Mechanisms Peter Pirolli User Interface Research Area Supported in part by Office of Naval Research

More information

Link Based Clustering of Web Search Results

Link Based Clustering of Web Search Results Link Based Clustering of Web Search Results Yitong Wang and Masaru Kitsuregawa stitute of dustrial Science, The University of Tokyo {ytwang, kitsure}@tkl.iis.u-tokyo.ac.jp Abstract. With information proliferation

More information

Introduction. What do you know about web in general and web-searching in specific?

Introduction. What do you know about web in general and web-searching in specific? WEB SEARCHING Introduction What do you know about web in general and web-searching in specific? Web World Wide Web (or WWW, It is called a web because the interconnections between documents resemble a

More information

Citations and Bibliographies

Citations and Bibliographies Citations and Bibliographies A bibliography is a list of sources that have been consulted or cited in a document. In Microsoft Word, it is possible to automatically generate a bibliography based on the

More information

Qualitative Data Analysis Software. A workshop for staff & students School of Psychology Makerere University

Qualitative Data Analysis Software. A workshop for staff & students School of Psychology Makerere University Qualitative Data Analysis Software A workshop for staff & students School of Psychology Makerere University (PhD) January 27, 2016 Outline for the workshop CAQDAS NVivo Overview Practice 2 CAQDAS Before

More information

An Introduction to PubMed Searching: A Reference Guide

An Introduction to PubMed Searching: A Reference Guide An Introduction to PubMed Searching: A Reference Guide Created by the Ontario Public Health Libraries Association (OPHLA) ACCESSING PubMed PubMed, the National Library of Medicine s free version of MEDLINE,

More information

Using the Web in Your Teaching

Using the Web in Your Teaching Using the Web in Your Teaching November 16, 2001 Dirk Morrison Extension Division, University of Saskatchewan Workshop Outline What will we cover? Why use the Web for teaching and learning? Planning to

More information

Destiny. Understanding Roles and Assigning Permissions Webinar. Participant Guide

Destiny. Understanding Roles and Assigning Permissions Webinar. Participant Guide Destiny Understanding Roles and Assigning Permissions Webinar Participant Guide 12011A v11.5 Destiny Understanding Roles and Assigning Permissions Webinar Participant Guide Version 11.5 Participant Guide

More information

Appendix A: Scenarios

Appendix A: Scenarios Appendix A: Scenarios Snap-Together Visualization has been used with a variety of data and visualizations that demonstrate its breadth and usefulness. Example applications include: WestGroup case law,

More information

Search Engine Architecture. Hongning Wang

Search Engine Architecture. Hongning Wang Search Engine Architecture Hongning Wang CS@UVa CS@UVa CS4501: Information Retrieval 2 Document Analyzer Classical search engine architecture The Anatomy of a Large-Scale Hypertextual Web Search Engine

More information

Setting up Flickr. Open a web browser such as Internet Explorer and type this url in the address bar.

Setting up Flickr. Open a web browser such as Internet Explorer and type this url in the address bar. Workshop 15 th July 2008 Page 1 http://blog.larkin.net.au/ Setting up Flickr Flickr is a photo sharing tool that has rich functionality and allows the public to share their photographs with others., It

More information

A Parallel Computing Architecture for Information Processing Over the Internet

A Parallel Computing Architecture for Information Processing Over the Internet A Parallel Computing Architecture for Information Processing Over the Internet Wendy A. Lawrence-Fowler, Xiannong Meng, Richard H. Fowler, Zhixiang Chen Department of Computer Science, University of Texas

More information

Supporting World-Wide Web Navigation Through History Mechanisms

Supporting World-Wide Web Navigation Through History Mechanisms Supporting World-Wide Web Navigation Through History Mechanisms Linda Tauscher Computer Science Department, University of Calgary tauscher@cpsc.ucalgary.ca Cite as: Tauscher, L. (1996) Supporting World

More information

Narrowing It Down: Information Retrieval, Supporting Effective Visual Browsing, Semantic Networks

Narrowing It Down: Information Retrieval, Supporting Effective Visual Browsing, Semantic Networks Clarence Chan: clarence@cs.ubc.ca #63765051 CPSC 533 Proposal Memoplex++: An augmentation for Memoplex Browser Introduction Perusal of text documents and articles is a central process of research in many

More information

Chapter 3: Google Penguin, Panda, & Hummingbird

Chapter 3: Google Penguin, Panda, & Hummingbird Chapter 3: Google Penguin, Panda, & Hummingbird Search engine algorithms are based on a simple premise: searchers want an answer to their queries. For any search, there are hundreds or thousands of sites

More information

Introduction to Compendium Tutorial

Introduction to Compendium Tutorial Instructors Simon Buckingham Shum, Anna De Liddo, Michelle Bachler Knowledge Media Institute, Open University UK Tutorial Contents http://compendium.open.ac.uk/institute 1 Course Introduction... 1 2 Compendium

More information

Incorporating Hyperlink Analysis in Web Page Clustering

Incorporating Hyperlink Analysis in Web Page Clustering Incorporating Hyperlink Analysis in Web Page Clustering Michael Chau School of Business The University of Hong Kong Pokfulam, Hong Kong +852 2859-1014 mchau@business.hku.hk Patrick Y. K. Chau School of

More information

Authoritative Sources in a Hyperlinked Environment

Authoritative Sources in a Hyperlinked Environment Authoritative Sources in a Hyperlinked Environment Journal of the ACM 46(1999) Jon Kleinberg, Dept. of Computer Science, Cornell University Introduction Searching on the web is defined as the process of

More information

The Internet The Internet

The Internet The Internet The Internet The Internet is a computer network made up of thousands of networks worldwide. No one knows exactly how many computers are connected to the Internet. It is certain, however, that these number

More information

WEB APPLICATION DEVELOPMENT. How the Web Works

WEB APPLICATION DEVELOPMENT. How the Web Works WEB APPLICATION DEVELOPMENT How the Web Works What Is Hypertext Markup Language? Web pages are created using Hypertext Markup Language (HTML), which is an authoring language used to create documents for

More information

CHECKPOINT CATALYST QUICK REFERENCE CARD

CHECKPOINT CATALYST QUICK REFERENCE CARD CHECKPOINT CATALYST QUICK REFERENCE CARD Logging in to Checkpoint 1. Launch your browser and enter the Checkpoint address in the browser location bar. http://checkpoint.thomsonreuters.com The Checkpoint

More information

ACM Digital Library. LIBRARY SERVICES

ACM Digital Library.  LIBRARY SERVICES ACM Digital Library The ACM Digital Library is a comprehensive database of full-text articles in computing and information technology. It provides access to the complete collection of ACM publications

More information

Competitive Intelligence and Web Mining:

Competitive Intelligence and Web Mining: Competitive Intelligence and Web Mining: Domain Specific Web Spiders American University in Cairo (AUC) CSCE 590: Seminar1 Report Dr. Ahmed Rafea 2 P age Khalid Magdy Salama 3 P age Table of Contents Introduction

More information

CADIAL Search Engine at INEX

CADIAL Search Engine at INEX CADIAL Search Engine at INEX Jure Mijić 1, Marie-Francine Moens 2, and Bojana Dalbelo Bašić 1 1 Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia {jure.mijic,bojana.dalbelo}@fer.hr

More information

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm K.Parimala, Assistant Professor, MCA Department, NMS.S.Vellaichamy Nadar College, Madurai, Dr.V.Palanisamy,

More information

Enhancing Cluster Quality by Using User Browsing Time

Enhancing Cluster Quality by Using User Browsing Time Enhancing Cluster Quality by Using User Browsing Time Rehab M. Duwairi* and Khaleifah Al.jada'** * Department of Computer Information Systems, Jordan University of Science and Technology, Irbid 22110,

More information

2. PRELIMINARIES MANICURE is specically designed to prepare text collections from printed materials for information retrieval applications. In this ca

2. PRELIMINARIES MANICURE is specically designed to prepare text collections from printed materials for information retrieval applications. In this ca The MANICURE Document Processing System Kazem Taghva, Allen Condit, Julie Borsack, John Kilburg, Changshi Wu, and Je Gilbreth Information Science Research Institute University of Nevada, Las Vegas ABSTRACT

More information

I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING. Andrii Donchenko

I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING. Andrii Donchenko International Journal "Information Technologies and Knowledge" Vol.1 / 2007 293 I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING Andrii Donchenko Abstract: This article considers

More information

Adaptive and Personalized System for Semantic Web Mining

Adaptive and Personalized System for Semantic Web Mining Journal of Computational Intelligence in Bioinformatics ISSN 0973-385X Volume 10, Number 1 (2017) pp. 15-22 Research Foundation http://www.rfgindia.com Adaptive and Personalized System for Semantic Web

More information

An Approach To Web Content Mining

An Approach To Web Content Mining An Approach To Web Content Mining Nita Patil, Chhaya Das, Shreya Patanakar, Kshitija Pol Department of Computer Engg. Datta Meghe College of Engineering, Airoli, Navi Mumbai Abstract-With the research

More information

EVALUATION OF SEARCHER PERFORMANCE IN DIGITAL LIBRARIES

EVALUATION OF SEARCHER PERFORMANCE IN DIGITAL LIBRARIES DEFINING SEARCH SUCCESS: EVALUATION OF SEARCHER PERFORMANCE IN DIGITAL LIBRARIES by Barbara M. Wildemuth Associate Professor, School of Information and Library Science University of North Carolina at Chapel

More information

COMPARISON OF FEATURES OF ONLINE JOURNALS AND DATABASES

COMPARISON OF FEATURES OF ONLINE JOURNALS AND DATABASES COMPARISON OF FEATURES OF ONLINE JOURNALS AND DATABASES C.N.D. PUNCHIHEWA 1 BSc (Colombo), MLS (Colombo) Abstract The objective of the study was to examine and make a comparison of the features of online

More information

Enhancing Cluster Quality by Using User Browsing Time

Enhancing Cluster Quality by Using User Browsing Time Enhancing Cluster Quality by Using User Browsing Time Rehab Duwairi Dept. of Computer Information Systems Jordan Univ. of Sc. and Technology Irbid, Jordan rehab@just.edu.jo Khaleifah Al.jada' Dept. of

More information

A Knowledge-Based Approach to Organizing Retrieved Documents

A Knowledge-Based Approach to Organizing Retrieved Documents A Knowledge-Based Approach to Organizing Retrieved Documents Wanda Pratt Information & Computer Science University of California, Irvine Irvine, CA 92697-3425 pratt@ics.uci.edu From: AAAI-99 Proceedings.

More information

Recommendation on the Web Search by Using Co-Occurrence

Recommendation on the Web Search by Using Co-Occurrence Recommendation on the Web Search by Using Co-Occurrence S.Jayabalaji 1, G.Thilagavathy 2, P.Kubendiran 3, V.D.Srihari 4. UG Scholar, Department of Computer science & Engineering, Sree Shakthi Engineering

More information

CIW: JavaScript Specialist. Course Outline. CIW: JavaScript Specialist. 30 Dec

CIW: JavaScript Specialist. Course Outline. CIW: JavaScript Specialist. 30 Dec Course Outline 30 Dec 2018 Contents 1. Course Objective 2. Pre-Assessment 3. Exercises, Quizzes, Flashcards & Glossary Number of Questions 4. Expert Instructor-Led Training 5. ADA Compliant & JAWS Compatible

More information

A Linear Regression Model for Assessing the Ranking of Web Sites Based on Number of Visits

A Linear Regression Model for Assessing the Ranking of Web Sites Based on Number of Visits A Linear Regression Model for Assessing the Ranking of Web Sites Based on Number of Visits Dowming Yeh, Pei-Chen Sun, and Jia-Wen Lee National Kaoshiung Normal University Kaoshiung, Taiwan 802, Republic

More information

Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach

Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach P.T.Shijili 1 P.G Student, Department of CSE, Dr.Nallini Institute of Engineering & Technology, Dharapuram, Tamilnadu, India

More information

ResPubliQA 2010

ResPubliQA 2010 SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first

More information

What is SEO? Search Engine Optimization 101

What is SEO? Search Engine Optimization 101 What is SEO? Search Engine Optimization 101 What is Search Engine Optimization (SEO)? Paid Search Listings SEO is the practice of improving and promoting a website to increase the number of Organic visitors

More information

Information Retrieval Spring Web retrieval

Information Retrieval Spring Web retrieval Information Retrieval Spring 2016 Web retrieval The Web Large Changing fast Public - No control over editing or contents Spam and Advertisement How big is the Web? Practically infinite due to the dynamic

More information

The Effect of Individual Differences on Searching the Web

The Effect of Individual Differences on Searching the Web The Effect of Individual Differences on Searching the Web Stéphane Chaudiron, Madjid Ihadjadene, Daniel Martins To cite this version: Stéphane Chaudiron, Madjid Ihadjadene, Daniel Martins. The Effect of

More information

Document Structure Analysis in Associative Patent Retrieval

Document Structure Analysis in Associative Patent Retrieval Document Structure Analysis in Associative Patent Retrieval Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550,

More information

Association-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications

Association-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications Association-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications Daniel Mican, Nicolae Tomai Babes-Bolyai University, Dept. of Business Information Systems, Str. Theodor

More information

A SURVEY ON WEB FOCUSED INFORMATION EXTRACTION ALGORITHMS

A SURVEY ON WEB FOCUSED INFORMATION EXTRACTION ALGORITHMS INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 A SURVEY ON WEB FOCUSED INFORMATION EXTRACTION ALGORITHMS Satwinder Kaur 1 & Alisha Gupta 2 1 Research Scholar (M.tech

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

Cost. For an explanation of JISC Banding and charging, please go to:

Cost. For an explanation of JISC Banding and charging, please go to: Web of Science provides access to current and retrospective multidisciplinary information from approximately 8,700 of the most prestigious, high impact research journals in the world. Web of Science also

More information

IBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008

IBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008 IBE101: Introduction to Information Architecture Hans Fredrik Nordhaug 2008 Objectives Defining IA Practicing IA User Needs and Behaviors The anatomy of IA Organizations Systems Labelling Systems Navigation

More information

Basics of SEO Published on: 20 September 2017

Basics of SEO Published on: 20 September 2017 Published on: 20 September 2017 DISCLAIMER The data in the tutorials is supposed to be one for reference. We have made sure that maximum errors have been rectified. Inspite of that, we (ECTI and the authors)

More information

WebBeholder: A Revolution in Tracking and Viewing Changes on The Web by Agent Community

WebBeholder: A Revolution in Tracking and Viewing Changes on The Web by Agent Community WebBeholder: A Revolution in Tracking and Viewing Changes on The Web by Agent Community Santi Saeyor Mitsuru Ishizuka Dept. of Information and Communication Engineering, Faculty of Engineering, University

More information

Ravel Law Quick Start Guide

Ravel Law Quick Start Guide Ravel Law Quick Start Guide Table of Contents Start Your Search 2 Search Results: The Visualization Map 4 Case Reading 6 Frequently Asked Questions 11 Appendix: Judge Analytics 13 Start Your Judge Search

More information

The Black Magic of Flash SEO

The Black Magic of Flash SEO The Black Magic of Flash SEO Duane Nickull Sr. Technical Evangelist Adobe Systems July 2008 Speaker bio - Duane Nickull!! Current!! Chair - OASIS SOA Reference Model Technical Committee (OASIS Standard

More information

Understanding the use of Temporal Expressions on Persian Web Search

Understanding the use of Temporal Expressions on Persian Web Search Understanding the use of Temporal Expressions on Persian Web Search Behrooz Mansouri Mohammad Zahedi Ricardo Campos Mojgan Farhoodi Alireza Yari Ricardo Campos TempWeb 2018 @ WWW Lyon, France, Apr 23,

More information

A User Study on Features Supporting Subjective Relevance for Information Retrieval Interfaces

A User Study on Features Supporting Subjective Relevance for Information Retrieval Interfaces A user study on features supporting subjective relevance for information retrieval interfaces Lee, S.S., Theng, Y.L, Goh, H.L.D., & Foo, S. (2006). Proc. 9th International Conference of Asian Digital Libraries

More information

Evaluation and Design Issues of Nordic DC Metadata Creation Tool

Evaluation and Design Issues of Nordic DC Metadata Creation Tool Evaluation and Design Issues of Nordic DC Metadata Creation Tool Preben Hansen SICS Swedish Institute of computer Science Box 1264, SE-164 29 Kista, Sweden preben@sics.se Abstract This paper presents results

More information

RSDC 09: Tag Recommendation Using Keywords and Association Rules

RSDC 09: Tag Recommendation Using Keywords and Association Rules RSDC 09: Tag Recommendation Using Keywords and Association Rules Jian Wang, Liangjie Hong and Brian D. Davison Department of Computer Science and Engineering Lehigh University, Bethlehem, PA 18015 USA

More information