High Ended Web Search Engine Taxonomy Using Concurrent Processing

Similar documents
Publishing Through Some Sites on the Internet

COURSE OUTLINE Survey of Operating Systems

Degree Branch / Specialization College University CSE SONA COLLEGE OF TECHNOLOGY : ASSISTANT PROFESSOR (SENIOR GRADE) ASSISTANT PROFESSOR

Teaching Object-Oriented Programming Concepts Using Visual Basic.NET

In the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google,

UNIT 2 TOPICS IN COMPUTER SCIENCE. Exploring Computer Science 2

Copyright CAUSE This paper was presented at the 1993 CAUSE Annual Conference held in San Diego, California, December 7-10, and is part of the

Domain Specific Search Engine for Students

Analyzing Working of FP-Growth Algorithm for Frequent Pattern Mining

Operating Systems : Overview

Threads in Operating System

Computer and Information Sciences College / Computer Science Department CS 207 D. Computer Architecture. Lecture 9: Multiprocessors

OPERATING- SYSTEM CONCEPTS

Accounting Software. Collaboration Software

A Java Execution Simulator

Certified Innovation Professional

Information Technology Certified Computer Professional Certification: Should Four-year Institutions Embrace It?

Resume. Techniques. Mail ID: Contact No.: S.No. Position held Organisation From To. AU PG Center, Vizianagaram

Susan Smith ECISD Audio Video Production 1415 South Seventh Ave Edinburg, TX Office Phone: 956/

Illinois State University 2012 Alumni Survey Institution Report

Illinois State University 2014 Alumni Survey Institution Report

Everything Google. Creating a Google Account. Google Search Engine

AC Care Mobile Application

(Refer Slide Time: 1:26)

ADAPTIVE HANDLING OF 3V S OF BIG DATA TO IMPROVE EFFICIENCY USING HETEROGENEOUS CLUSTERS

Sermakani. AM Mobile: : IBM Rational Rose, IBM Websphere Studio Application Developer.

CSC Introduction to Computers and Their Applications

CONTINUING PROFESSIONAL DEVELOPMENT REGULATIONS

REFERENCE GUIDE FOR MANUAL DATA INPUT v1.1

Software Concepts. It is a translator that converts high level language to machine level language.

DOWNLOAD PDF TELEPHONE BILLING SYSTEM PROJECT

IMPLEMENTATION OF CLASSIFICATION ALGORITHMS USING WEKA NAÏVE BAYES CLASSIFIER

Web Systems Staff Intranet Card Sorting. Project Cover Sheet. Library Staff Intranet. UM Library Web Systems

CPLP Recertification Policies... 2 Recertification Categories... 2

Wrapper: An Application for Evaluating Exploratory Searching Outside of the Lab

SYLLABUS. Departmental Syllabus CIST0147. Departmental Syllabus. Departmental Syllabus. Departmental Syllabus. Departmental Syllabus

Lecture 1 Niyaz M. Salih

Introduction CHAPTER. Practice Exercises. 1.1 What are the three main purposes of an operating system? Answer: The three main puropses are:

REQUEST FOR DRI RECERTIFICATION BY CONTINUING EDUCATION UNITS

Requisite Skills and Knowledge for Entry-level IT Auditors

Big Java Late Objects Solution Manual

MS SQL Server DBA Training Course. Table of Contents

General Purpose GPU Programming. Advanced Operating Systems Tutorial 9

OHIO PUBLIC LIBRARIAN CERTIFICATION PROGRAM

Private Subnetting One (PS1), an Algorithm for Private IPv4 Address Allocation

Personal User Guide : adding content to Pure 1

A Modeling and Detection of Deadlock in Early Stage of System using UML

For students entering Part 1 in September 2018 UFCSWIYB

THE CPU SPENDS ALMOST ALL of its time fetching instructions from memory

Computing and Programming

For students entering Part 1 in September 2019 UFCSWIYB

COSC243 Part 2: Operating Systems

As an IIA certified professional, the member is responsible for ensuring that the CPD information reported is accurate.

SYLLABUS. Departmental Syllabus CIST0145. Departmental Syllabus. Departmental Syllabus. Departmental Syllabus. Departmental Syllabus

Operating System Design

Operating Systems: Internals and Design Principles. Chapter 2 Operating System Overview Seventh Edition By William Stallings

Questions answered in this lecture: CS 537 Lecture 19 Threads and Cooperation. What s in a process? Organizing a Process

COMPUTER SCIENCE. Computer Science 123

PRABAVADHI.J. Professional Objective:

Personal Profile. Allied Additional Subordinate Subject studied. Management Information System. Electronic Commerce. Image Processing.

This document/guide contains dated material; always check the ASMC website for the most recent information, policies, and other information.

General Purpose GPU Programming. Advanced Operating Systems Tutorial 7

Searching for Information

NETW3005 Operating Systems Lecture 1: Introduction and history of O/Ss

HUKB at NTCIR-12 IMine-2 task: Utilization of Query Analysis Results and Wikipedia Data for Subtopic Mining

MASTER OF SCIENCE IN COMPUTER SCIENCE

A Framework for the Development of Distributed Interactive Applications

Interactive Campaign Planning for Marketing Analysts

INFS 214: Introduction to Computing

A Hands-on Project approach to Teaching Solid Modeling

ESSENTIALS OF COMPUTER ARCHITECTURE, SECOND EDITION BY DOUGLAS COMER

FACULTY PROFILE. Total Experience : Academic : 7 Years and 3 Months. Degree Branch / Specialization College University

Processes and Threads. Processes: Review

International Journal of Software and Web Sciences (IJSWS) Web service Selection through QoS agent Web service

ESSENTIALS OF COMPUTER ARCHITECTURE, SECOND EDITION BY DOUGLAS COMER

Effective Java Programming Language Guide 2nd Edition By Joshua Bloch

COMP 117: Internet-scale Distributed Systems Lessons from the World Wide Web

Course Design, Representation and Browser for Web Based Education

CPLP and APTD Recertification Policies Table of Contents

Web Annotator. Dale Reed, Sam John Computer Science Department University of Illinois at Chicago Chicago, IL

Le L c e t c ur u e e 7 To T p o i p c i s c t o o b e b e co c v o e v r e ed e Multithreading

Self-tuning ongoing terminology extraction retrained on terminology validation decisions

CS418 Operating Systems

BSc (Honours) Computer Science Curriculum Outline

Higher National Unit specification: general information

The Art of Data Structures Introduction. Richard E Sarkis CSC 162: The Art of Data Structures

CS352 Lecture - Data Models

Chapter 2 Operating System Overview

Handling Multithreading Approach Using Java Nikita Goel, Vijaya Laxmi, Ankur Saxena Amity University Sector-125, Noida UP India

Application for Lead-based Paint Activities Training Program Accreditation

High Performance Computing Prof. Matthew Jacob Department of Computer Science and Automation Indian Institute of Science, Bangalore

PATH OPTIMIZATION ALGORITHM FOR NETWORK PROBLEMS USING JOB SEQUENCING TECHNIQUE

SIMULATION-BASED COMPARISON OF SCHEDULING TECHNIQUES IN MULTIPROGRAMMING OPERATING SYSTEMS ON SINGLE AND MULTI-CORE PROCESSORS *

LECTURE/ STUDY NOTES ON C

INTERNET SEARCH STRATEGY LECTURER: MARK GILLAN

CSC 2405: Computer Systems II

A Disk Head Scheduling Simulator

IASA CONTINUING EDUCATION UNITS

INFS 427: AUTOMATED INFORMATION RETRIEVAL (1 st Semester, 2018/2019)

Next Generation LMS Evaluation

Transcription:

Proceedings of Informing Science & IT Education Conference (InSITE) 2010 High Ended Web Search Engine Taxonomy Using Concurrent Processing Bala Dhandayuthapani Veerasamy Mekelle University, Mekelle, Ethiopia dhanssoft@gmail.com Abstract Everyone around the world is searching for information. In the Internet world people use web search engine to look for certain topics. Basically, search engines are more helpful for everyone to search their needs such as Google, Yahoo, and AltaVista. Sometimes, searching same topic on different search engine will produce different results. Thus, we have a novel on high ended search engine taxonomy to meet more and clear information. This paper proposes to exercise one or more search key and search engines on single web search. Keywords: Concurrent, Multithread, Process, Search engine, Search key, Task. Introduction Processor, program and process are three vital elements of any computer execution. Processor is a hardware device, which executes machine instructions. Program is a sequence of instructions awaiting execution. It is a completely static description of what would happen on computer. Process is a program in execution on a processor. It is an active entity in the system, which executes the algorithm defined by a program. Typically, Tasks are logically discrete section of computational work. A task is typically a program or program-like set of instructions that is executed by a processor. Concurrent programming is just a computer program (Veerasamy, 2010a, 2010b), which involves more than one process. Two events are said to be concurrent if they occur within the same time interval. Two or more tasks executing over the same time interval are said to execute concurrently. Tasks that exist at the same time and perform them in a same time period are concurrent. Concurrent tasks can execute in a single or multiprocessing environment. In a single processing environment, concurrent tasks exist at the same time and execute within the same time period by context switching. In a multiprocessor environment, if enough processors are free, concurrent tasks may execute at the same instant over the same time period. Material published as part of this publication, either on-line or in print, is copyrighted by the Informing Science Institute. Permission to make digital or paper copy of part or all of these works for personal or classroom use is granted without fee provided that the copies are not made or distributed for profit or commercial advantage AND that copies 1) bear this notice in full and 2) give the full citation on the first page. It is permissible to abstract these works so long as credit is given. To copy in all other cases or to republish or to post on a server or to redistribute to lists requires specific permission and payment of a fee. Contact Publisher@InformingScience.org to request redistribution permission. Concurrency techniques are used to allow a computer program to do more work over the same time period or time interval. Rather than designing the program to do one task at a time, the program is broke down into tasks that can be executed concurrently. In some situations, doing more work over the same time period is not the goal, rather sim-

High Ended Web Search Engine Taxonomy plifying the programming solution is the goal. Sometimes, it makes more sense of thinking for the solution through concurrently executed tasks. In the early days of computing, computers were single tasking that is, they ran a single job at a time. The big lumbering machine would start one job, run that job to completion and then start the next job, and so on. When engineers became overly frustrated with these batch oriented systems, they rewrote the programs that ran the machines and thus was born the modern multitasking operating system. Multitasking refers to a computer's ability to perform multiple jobs concurrently. For the most part, modern desktop operating systems like Windows 95 or OS/2 have the ability to run two or more programs at the same time. While you are using Netscape to download a big file, you can be playing solitaire in a different window; both programs are running at the same time. Multithreading is an extension of the multitasking paradigm. But, rather than multiple programs, multithreading involves multiple threads of control within a single program. Not only the operating systems run multiple programs, but also each program can run multiple threads of control within the program. For example, using a web browser, you can print one web page, download another, and fill out a form in a third all at the same time. Usually, a thread is a single sequence of execution within a program. Basically, search engine can search matching details from its own database and the results displayed on web pages. It is inadequate also irrelevant for any users. Hence, high ended web search taxonomy proposes to serve full-fledged search results to the internet users. Search key is the topics, which we about to search on the web. The proposed high ended search engine taxonomy is described in bellow. Methodology There are different ways to classify parallel computers. One of the more widely used classifications, in use since 1966, is called Flynn's Taxonomy. Flynn's taxonomy distinguishes multiprocessor computer architectures according to how they can be classified along the two independent dimensions of instruction and data. Each of these dimensions can have only one of two possible states, are single and multiple. This dimension and states organize four possible classifications (Veerasamy, 2010a, 2010b) according to Flynn s and is shown in Figure 1. S I S D S I M D Single Instruction, Single Data Single Instruction, Multiple Data M I S D M I M D Multiple Instruction, Single Data Multiple Instruction, Multiple Data Figure 1: Flynn's Taxonomy According to dimensions and state of Flynn's taxonomy, we take instruction as Search key and data as search engine. Each of these dimensions can have only one of two possible states, are single and multiple. This dimension and states organize four possible classifications and is shown in Figure 2. These classifications proposed as High Ended Web Search Engine Taxonomy. SSSS SSMS Single Search key Single Search engine Single Search key Multiple Search engine MSSS MSMS Multiple Search key Single Search engine Multiple Search key Multiple Search engine Figure 2: High Ended Web Search Engine Taxonomy 70

Veerasamy SSSS- Single Search key Single Search engine Everyday world, everybody is using this Single Search key Single Search engine (SSSS) taxonomy. In this taxonomy, we are using one Search key that is assigned to one search engine. SSSS will be displaying lot of details as we search, some time these pages may be relevant or may not be relevant to our target. Sometimes, it may waste our time. The following Figure 3 shows the taxonomy of Single Search key Single Search engine. Search key Search engine Figure 3: Single Search key Single Search engine Taxonomy Single Search key Single Search engine illustrates usual search on the web. For instance, Yahoo search engine asks Single Search key from internet users and carry relevant topics from Yahoo s server. The following Figure 4 shows Yahoo search engine. Figure 4: Yahoo Search Engine SSMS- Single Search key Multiple Search engines Single Search key Multiple Search engines (SSMS) taxonomy uses one Search key that is assigned to more than search engine. SSMS proposed to get details from many servers database. This taxonomy is more powerful than SSSS. Here, increasing number of search engine will directly proportional to increasing the target details. The following Figure 5 shows the taxonomy of Single Search key Multiple Search engines. Search engine 1 Search key Search engine 2 Search engine N Figure 5: Single Search key Multiple Search engines Taxonomy 71

High Ended Web Search Engine Taxonomy For example, Figure 6 illustrates conducting the same search key using different search engines simultaneously. Thus, we are passing same Search key to AltaVista (http://www.altavista.com), Yahoo (http://www.search.yahoo.com), and Google (http://www.google.com). As a result, we Search Key will be obtaining more search results on single web search. Figure 6: Example for SSMS MSSS- Multiple Search keys Single Search engine Mostly computer technologists are at presently using the taxonomy of Multiple Search keys Single Search engine (MSSS) for advanced web search. In this taxonomy, we will use more than one Search keys that is assigned to a search engine. MSSS will be displaying lot of details as we search, but these details are from one search engine and will display them. The following Figure 7 shows the taxonomy of Multiple Search keys Single Search engine. Search Key 1 Search Key 2 Search Engine Search Key N Figure 7: Multiple Search keys Single Search engine Taxonomy This Multiple Search keys Single Search engine (MSSS) illustrates an advanced search on the web search engines. For instance, Yahoo search engine contain an advanced web search, which will ask Multiple Search Keys from users to search relevant topics and is shown in Figure 8. 72

Veerasamy Figure 8: Advanced Yahoo Search Engine MSMS- Multiple Search keys Multiple Search engines Multiple Search keys Multiple Search engines (MSMS) taxonomy illustrates an advanced search on the web. Here, we use more than one Search keys that is assigned to more than one search engines. Really, the entire search details will generate from many databases and display them. Some details may come from one search engine and some another details come from another search engines. It is more powerful than Multiple Search key Single Search engine. Here, increasing number of Search key and increasing numbers of search engines will lead to increasing the target details. The following Figure 9 shows the taxonomy Multiple Search keys Multiple Search engines. Search Key 1 Search Engine 1 Search Key 2 Search Engine 2 Search Key N Search Engine N Figure 9: Multiple Search keys Multiple Search engines Taxonomy For example, the following Figure 10 illustrates an advanced search on different search engine at a same time. Here, we pass multiple Search keys on AltaVista, Yahoo, and Google. As a result, we will search out exact details about a particular topic. Hence, we never face any failure in searching. 73

High Ended Web Search Engine Taxonomy Search Key Figure 10: Example for MSMS Implementations Single Search key Single Search engine (SSSS) and Multiple Search keys Single Search engine (MSSS) are usually held on most of search engines. But, we recently have Single Search key Multiple Search engines (SSMS) and Multiple Search keys Multiple Search engines (MSMS). These two taxonomies are implemented in the following algorithms 74

Veerasamy 1. SSMS algorithm 2. MSMS algorithm SSMS algorithm is written for Single Search key Multiple Search engines taxonomy and MSMS algorithm is written for Multiple Search keys Multiple Search engines. SSMS Algorithm Step 1: Get Search key Step 2: Declare number of search engine, in which you are going to search Step 3: Create threads, based on the number of search engine. Thread, which includes retrieving process from each search engine Step 4: Assign Search key to each thread Step 5: Run threads concurrently, while running task Step 6: While fetch data from relevant search engines do usual search on all number of search engines and display on the page. Step 7: Until complete the searching, continue step 5, step 6 Step 8: After finishing search, destroy the thread MSMS Algorithm Step 1: Get Search key 1, Search key 2 Search key N Step 2: Declare search engine, in which you are going to search Step 3: Create threads, based on the number of search engine.. Thread, which includes retrieving process from each search engine Step 4: Based on the number of Search key, assign each Search key to all number of search engine Step 5: Run thread concurrently, while running thread Step 6: While fetch data from relevant search engines do advanced search on all number of search engines and display on the page. Step 7: Until complete the searching, continue step 5, step 6 Step 8: After finishing search, destroy the thread Conclusion Concurrency technique allows a computer program to do more work over the same time period. This kind of computer programs really involves more than one process. It makes more sense of executing tasks concurrently. Web search engines are more supportive to everyone to search topics. The actual idea is, to create search engine program by using concurrent programming with the help of High Ended Web Search Engine Taxonomy. This taxonomy can be real word web search engines, which recently have Single Search key Multiple Search engines (SSMS) and Multiple Search keys Multiple Search engines (MSMS) with SSMS and MSMS algorithms. These algorithms are suggested to create any newly proposed web search engines. SSMS algorithm is somewhat moderated and MSMS algorithm can be an ultimate web search engine, because it has advanced search with multiple search engines. Hence, MSMS algorithm provides a high ended taxonomy for web search engine using concurrent programming. References Veerasamy, B. D. (2010a). Concurrent approach to Flynn s MPSD classification through Java. International Journal of Computer Science and Network Security, 10(1), 126-129. Veerasamy. B. D. (2010b). Concurrent approach to Flynn s MPMD classification through Java. International Journal of Computer Science and Network Security, 10(2), 164-167. 75

High Ended Web Search Engine Taxonomy Biography Bala Dhandayuthapani Veerasamy was born in Tamil Nadu, India in the year 1979. The author was awarded his first masters degree M.S in Information Technology from Bharathidasan University in 2002 and his second masters degree M.Tech in Information Technology from Allahabad Agricultural Institute of Deemed University in 2005. He has published more than fifteen peer reviewed technical papers on various international journals and conferences. He has managed as technical chairperson of an international conference. He has an active participation as a program committee member as well as an editorial review board member in international conferences. He is also a member of an editorial review board in international journals. He has offered courses to Computer Science and Engineering, Information Systems and Technology, since 8 years in the academic field. His academic career started in reputed engineering colleges in India. At present, he is working as a Lecturer in the Department of Computing, College of Engineering, Mekelle University, Ethiopia. His teaching interest focuses on Parallel and Distributed Computing, Object Oriented Programming, Web Technologies and Multimedia Systems. His research interest includes Parallel and Distributed Computing, Multimedia and Wireless Computing. He has prepared teaching material for various courses that he has handled. At present, his textbook on An Introduction to Parallel and Distributed Computing through java is under review and is expected to be published shortly. He has the life membership of ISTE (Indian Society of Technical Education). 76