An Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique

Similar documents
Focused Retrieval Using Topical Language and Structure

The Journal of MacroTrends in Technology and Innovation

Automatic Query Type Identification Based on Click Through Information

ICTNET at Web Track 2010 Diversity Task

Exam IST 441 Spring 2011

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks

Reducing Redundancy with Anchor Text and Spam Priors

Experiments on Related Entity Finding Track at TREC 2009 Qing Yang,Peng Jiang, Chunxia Zhang, Zhendong Niu

University of Twente at the TREC 2008 Enterprise Track: Using the Global Web as an expertise evidence source

HYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL

A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH

CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL

Deep Web Crawling and Mining for Building Advanced Search Application

Automated Online News Classification with Personalization

Search Engines Information Retrieval in Practice

Microsoft Research Asia at the Web Track of TREC 2009

Company Search When Documents are only Second Class Citizens

RSDC 09: Tag Recommendation Using Keywords and Association Rules

International Journal of Scientific & Engineering Research Volume 2, Issue 12, December ISSN Web Search Engine

TREC-10 Web Track Experiments at MSRA

Verbose Query Reduction by Learning to Rank for Social Book Search Track

Dynamic Visualization of Hubs and Authorities during Web Search

A SURVEY ON WEB FOCUSED INFORMATION EXTRACTION ALGORITHMS

Review on Techniques of Collaborative Tagging

Tilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval

An Enhanced Page Ranking Algorithm Based on Weights and Third level Ranking of the Webpages

Relevance Weighting for Query Independent Evidence

A NEW CLUSTER MERGING ALGORITHM OF SUFFIX TREE CLUSTERING

THE WEB SEARCH ENGINE

Natural Language Processing

A Content-Based Method to Enhance Tag Recommendation

From Passages into Elements in XML Retrieval

Link-Contexts for Ranking

Use of Content Tags in Managing Advertisements for Online Videos

Query Independent Scholarly Article Ranking

Springer Science+ Business, LLC

An Implementation and Analysis on the Effectiveness of Social Search

University of Delaware at Diversity Task of Web Track 2010

Ontology-Based Web Query Classification for Research Paper Searching

Word Disambiguation in Web Search

A Scalable Image Snippet Extraction Framework for Integration with Search Engines

Exam IST 441 Spring 2014

Abstract. 1. Introduction

A New Technique to Optimize User s Browsing Session using Data Mining

A P2P-based Incremental Web Ranking Algorithm

University of Glasgow at TREC2004: Experiments in Web, Robust and Terabyte tracks with Terrier

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

SNUMedinfo at TREC CDS track 2014: Medical case-based retrieval task

Visual Query Suggestion

An Improved Framework for Tag-Based Academic Information Sharing and Recommendation System

ResPubliQA 2010

NUSIS at TREC 2011 Microblog Track: Refining Query Results with Hashtags

Linking Entities in Chinese Queries to Knowledge Graph

Semantic Website Clustering

Exam IST 441 Spring 2013

Exploiting routing information encoded into backlinks to improve topical crawling

Collaborative Filtering using Euclidean Distance in Recommendation Engine

Northeastern University in TREC 2009 Million Query Track

WEB STRUCTURE MINING USING PAGERANK, IMPROVED PAGERANK AN OVERVIEW

Ranking Function Optimizaton Based on OKAPI and K-Means

Inferring User Search for Feedback Sessions

Effective On-Page Optimization for Better Ranking

CS646 (Fall 2016) Homework 1

Evaluating an Associative Browsing Model for Personal Information

ADVANCED LEARNING TO WEB FORUM CRAWLING

Hierarchical Location and Topic Based Query Expansion

Running Head: HOW A SEARCH ENGINE WORKS 1. How a Search Engine Works. Sara Davis INFO Spring Erika Gutierrez.

Chapter 27 Introduction to Information Retrieval and Web Search

AN EFFICIENT COLLECTION METHOD OF OFFICIAL WEBSITES BY ROBOT PROGRAM

IPOMS: an Internet Public Opinion Monitoring System

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Information Retrieval

Tagging tagging. Analysing user keywords in scientific bibliography management systems

Searching the Web [Arasu 01]

I. INTRODUCTION. Fig Taxonomy of approaches to build specialized search engines, as shown in [80].

Time-aware Approaches to Information Retrieval

Measuring Semantic Similarity between Words Using Page Counts and Snippets

HITS and Misses: Combining BM25 with HITS for Expert Search

UMass at TREC 2006: Enterprise Track

Enabling Users to Visually Evaluate the Effectiveness of Different Search Queries or Engines

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web

A New Approach for Automatic Thesaurus Construction and Query Expansion for Document Retrieval

Information Retrieval Issues on the World Wide Web

HFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging

Popularity Weighted Ranking for Academic Digital Libraries

Statistical Language Models for Intelligent XML Retrieval

University of TREC 2009: Indexing half a billion web pages

IMPROVING PERSONALIZED WEB SEARCH USING BOOKSHELF DATA STRUCTURE

Information Retrieval

Estimating Credibility of User Clicks with Mouse Movement and Eye-tracking Information

A Constrained Spreading Activation Approach to Collaborative Filtering

COMPARATIVE ANALYSIS OF POWER METHOD AND GAUSS-SEIDEL METHOD IN PAGERANK COMPUTATION

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines

Department of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _

Domain Specific Search Engine for Students

The Use of Domain Modelling to Improve Performance Over a Query Session

Crawler with Search Engine based Simple Web Application System for Forum Mining

Automated Path Ascend Forum Crawling

Transcription:

An Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique 60 2 Within-Subjects Design Counter Balancing Learning Effect 1 [1 [2www.worldwidewebsize.com 2560 2561 44,000 1 : ABSTRACT Because of information overloaded problem, search engines become important tools to help WWW users to discover the information they desire. This paper aims at designing a search result page with a clustering technique to improve an efficiency of search results accessby applying it to image searching task. To evaluate the proposed design, sixty graduate students from the department of Web Engineering and Mobile Application Development were recruited to be participants in this research study. They were given two tasks of finding images and were asked to use a prototype of search engine with displaying the results as lists and as clusters. The within-subjects design with a counter balancing technique for eliminating a learning effect was adopts as an experimental research design. The results showed that a search result page with a clustering technique could improve an efficiency of search results access. Keywords: Search Result Page Design, Search Engine, Clustering Technique 1: (Information Overload Problem) [1 (Search Engine) ( Webpage Crawling Process Improvement) (Webpage 1

Indexing Process Improvement) ( Search Results Ranking Mechanism Improvement) (Search Engine Result Page) (Clustering Technique) 2) 2.1) (Web Crawler) (Index Engine) (Search Application) (Ranking Engine) (Evaluation Engine) [3] 2 (Web Crawler) (Document Corpus) (Index Engine) (Search Application) (Index Engine) (Search Application) (Query) (Query Refinement) (Search Results) (Ranking Engine) (Relevant) (Evaluation Engine) (Search Application) 2: 2.2) [1] (Document Title) (Anchor Text) [4-7] (User Usage Log) (Evaluation Engine) (Query) (Search Results List) (Relevant) [8-9] (Social Annotation) 2

(Social Tag) [10-13] Brin and Page [4] Choochaiwattana and Spring [10] Jomsri et al. [14] [15] 12 3) (Clustering Technique) (Prototype of WWW Image Search Engine) 4) 4.1) (Search Results List) 3 (Clustering Technique) 4 3: 4: 4.2) 2560 API Metadata Image Content [3] TF-IDF Ranking Engine) Cosine Similarity Query) 1 3

(Search Engine Prototype) 2 5 5 : Clustering Engine Hierarchical Clustering (Cluster Labeling) 4.3) 60 2 (Query) 2 1 (Learning Effects) Within-Subjects Design Counter Balancing 30 1 2 30 1 2 5) 1 11.15 1: (List) (Clustering Technique) 56.24 45.09 9.28 12.44 (Query Term) 4

(Clustering) 75 6) (Clustering Technique) 2 60 11.15 75 [1], :,, 3, 1, 73 83, 2557 [2] WorldWideWebSize.com The size of the World Wide Web (The Internet). [Online]. Available: http://www.worldwidewebsize.com/. [Accessed: 14-Feb-2018]. [3] B. Croft, D. Metzler, and T. Strohman, Search Engines: Information Retrieval in Practice. Boston: Pearson, 2009. [4] S. Brin and L. Page, The Anatomy of a Large-scale Hypertextual Web Search Engine, in Proceedings of the Seventh International Conference on World Wide Web 7, Amsterdam, The Netherlands, The Netherlands, 1998, pp. 107 117. [5] N. Craswell, D. Hawking, and S. Robertson, Effective Site Finding Using Link Anchor Information, in Proceedings of the 24 th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, NY, USA, 2001, pp. 250 257. [6] T. H. W. Westerveld, W. Kraaij, and D. Hiemstra, Retrieving Web Pages using Content, Links, URLs and Anchors, in Proceedings of the Tenth Text retrieval Conference (TREC 2001), Gaithersburg, USA, 2001, pp. 663 672 [7] A. Kundu, S. Sett, S. Kumar, S. Sengupta, and S. Chaudhury, Search Engine Indexing Storage Optimization Using Hamming Distance, International Journal of Intelligent Information and Database Systems, vol. 6, no. 2, pp. 113 128, Mar. 2012. [8] G. Xue, H. Zeng, Z. Chen, T. Yu, W. Ma, W. Xi, and W. Fan, Optimizing Web Search Using Web Click-Through Data, Proceedings of the 13 th ACM Conference on Information and Knowledge Management, Washington DC, USA, 2004, pp.118-126. [9] F. Ahmadi-Abkenari and A. Selamat, Advantages of Employing Log Rank Web Page Importance Metric in Domain Specific Web Search Engines, International Journal of Digital Content Technology and its Application, vol. 7, no. 9, pp. 425-432, 2013. [10] W. Choochaiwattana and M. Spring, Applying Social Annotations to Retrieve and Re-rank Web Resources, Proceedings of the International Conference on Information Management and Engineering, Kuala Lumpur, Malaysia, 2009, pp. 215-219. [11] P. Dmitriev, N. Eiron, M. Fontoura, and E.Shekita, Using Annotations in Enterprise Search, Proceedings of the 15 th International World Wide Web Conference, Edinburgh, Scotland, 2006, pp. 811-817. [12] C. Marlow, M. Naaman, D. Boyd, and A. Davis, Tagging Paper, Taxonomy, Flickr, Academic Article, To Read, Proceedings of the 17 th ACM Conference on Hypertext and Hypermedia, Odense, Denmark, 2006, pp. 31-40. 5

[13] D. Zhou, J. Bian, S. Zheng, H. Zha, and C.L. Giles, Exploring Social Annotations for Information Retrieval, Proceedings of the 17 th International World Wide Web Conference, Beijing, China, 2008, pp. 715-724. [14] P. Jomsri, S. Sanguansintukul, and W. Choochaiwattana, CiteRank: Combination Similarity and Static Ranking with Research Paper Searching, International Journal of Internet Technology and Secured Transactions, vol. 3, no. 2, pp. 161-177, 2011. [15],,,, 18, 1, 89 98, 2551 6