Search Engine Survey. May 17, The World Wide Web (WWW) has become a huge information source whose content is increasing

Size: px
Start display at page:

Download "Search Engine Survey. May 17, The World Wide Web (WWW) has become a huge information source whose content is increasing"

Transcription

1 Search Engine Survey by Wei Tang May 17, Introduction The World Wide Web (WWW) has become a huge information source whose content is increasing and changing every day. To nd something in this huge data base is like nding a needle in a haystack. Then there comes the help of search engines. Since the emergence of the Internet and the WWW, people have been thinking of ways to ease the search of information. Search engines are considered to be the most successful thing on the Net since its creation. A little history The grandfather of all search engines was Archie [?], created in 1990 by Alan Emtage, a student at McGill University in Montreal. As early as 1990, there was no WWW, the primary method of storing and retrieving les was via the File Transfer Protocol (FTP). Initially, anyone who wanted to share a le had to set up an FTP server in order to make the le available to others, allowing others to retrieve it. But many important les were still scattered on various FTP servers. The le distribution was by passing the word around on the Internet, normally through exchanges and discussion groups. Archie changed all this. Archie's gatherer searches FTP sites across the Internet and indexed all of the les it found. Its regular expression matcher provided users with access to its database. Then came Veronica. It was created as a type of searching device similar to Archie but for Gopher les. Matthew Gray's World Wide Web Wanderer was the rst robot on the web and was designed to track the Web's growth. It soon amended its way because early versions of the software ran rampant through the Net and caused a noticeable netwide performance degradation. But the Robot technique introduced by Wanderer were widely adopted in later search engines development. By December 1993, three search engines powered by robots had made their debut: JumpStation, the World Wide Web Worm, and the Repository- Based Software Engineering (RBSE) spider. Due to lacking the intelligence to understand what it was that they were indexing, early spiders really suered. That was why some searchable directories came about. The pioneer was EINet Galaxy, now know as the Tradewave Galaxy. The Galaxy went online in January It contained Gopher and Telnet search features in addition to the web-searching features. Galaxy is a true directory in the sense that it lists only URLs that have been submitted to it, and all categorization and review of the submitted URLs is done by hand. This results in higher-quality pages and more relevant searches, but 1

2 far fewer pages to search through. In April 1994, two Stanford Ph.D. candidates, David Filo and Jerry Yang created the fantastic Yahoo! [?]. Yahoo! was considered a searchable directory rather than a search engine because the entries were entered and categorized manually. But since Yahoo! has automated some aspects of the gathering and classication process, it blurred the distinction between search engine and directory. In early 1994, WebCrawler came into being with its very unique feature: allowing the user to search the full text of entire documents, changing the way previous robots worked: storing the title and the URL, and the rst 100 or so words of a document 1. The most important point about WebCrawler is that it was the rst full-text search engine on the Internet. Several competitors emerged within a year of WebCrawler's debut: Lycos, Infoseek, and Open- Text. Lycos was out of the labs at Carnegie Mellon University during the July of Infoseek was initially just another search engine. It borrowed conceptually from Yahoo! and Lycos, not really innovating in any particular way. Infoseek's user-friendly interface and the numerous additional services (such as UPS tracking, News, a directory, and the like) got it commendation, but it was Infoseek's strategic deal with Netscape in December 1995 that brought it to the forefront in the search engine camp. Infoseek convinced Netscape to have its engine pop up as the default when people hit the Net Search button on the Netscape browser (Prior to this, Yahoo! was Netscape's default search service, and now it is Excite.). Because of Netscape's undoubtable dominance in the browser market by then, Infoseek gained the popularity. AltaVista was a latecomer to the scene. It had its online debut in December Nonetheless, it had a number of innovative features that quickly catapulted it to the top. One of the most impressive features is its speed. Run on a cluster of Alpha stations, AltaVista can handle millions of hits per day without slowing down its processing a bit. Today, AltaVista has grown to be one of the largest index engines. Main memory indexing and cluster servers have made this to happen. On May 20th, 1996, Inktomi Corporation [?] was formed, and HotBot was unleashed to the world (HotBot is powered by Inktomi, so it Lycos now.). The Inktomi engine was quickly licensed to Wired magazines web site, HotWired. This site's popularity accounted for much of the initial fervor over HotBot. Although the youngest among all the major search engines, HotBot is probably the most powerful of the search engines, with a robot that can supposedly index 10 million pages per day. Northern Light Technology, was founded \by a seasoned management team of librarians, businessto-business, and Internet professionals who recognized the need to ll the gap left by search engines and private research services. Northern Light introduced its research engine in August of 1997" [?]. Northern Light claimed to have the largest search engine database. It now partners with the U.S. Commerce Department's National Technical Information Service (NTIS) for the "one-stop shopping" service: gov.search [?]. This survey is organized as follows: in Section??, I will give a brief classication of dierent search services; Section?? introduces some basic concepts about search engines and surveys several major search engines, including AltaVista [?], HotBot [?], Infoseek [?], Lycos [?], Excite [?], and Northern Light [?]; Section?? presents a summary of this report. 1 In 1997, Excite bought out WebCrawler, and now AOL is using an Excite derivative as the engine behind its own NetFind. 2

3 2 Classication of search services As discussed in Section??, there are various avors of search services on the Web. Generally, they can be classied into three categories: dictionaries, search engines and meta engines. Directory (or Catalog) One of the most commonly misunderstood topics is the dierence between directories and search engines. Directories, unlike search engines, are not automated. Site URLs, titles and descriptions must be submitted to them manually for best results. Usually, each directory presents the user with a number of categories under which the listing can be placed. In some cases, the directory will determine which category the site will be placed into, either automatically or through a manual review by an editorial sta. The best example of a directory is Yahoo! [?]. They are placed in easily navigable category folders and sub-folders. When a search is performed on a directory, the results that are generated originate directly from the listing information that was organized under a certain topic in the dictionary. Another example is LookSmart [?]. Search engines There are relatively fewer search engines than directories on the Web today because of the technical diculties of building a real search engine. The real search engines are \robot-based" because they employ programs called \robots" or \spiders" that are constantly roaming the web indexing everything they nd. Search engine databases, when compared with that of directories are huge. There are two reasons for this: 1. search engines do not review sites that are submitted to decide if their content is relevant - everything that meets its programmed requirements is accepted; 2. search engines will index multiple pages from one web site, whereas directories will usually only allow you one (sometimes two) listings per domain name. Currently the most popular robot-based search engines on the Web are: AltaVista, HotBot, Excite, Lycos, Infoseek, Webcrawler, Northern Light, Planetsearch, and GoTo.com. In the rest of this report, I refer to these robot-based search engines \general purpose search engines". Meta engines Another category of online search services is \meta engines", which forward search queries to many of the major search engines at once. The interface is usually uniform, hiding the distinct features from each used search engine. And the results are normally reformatted by the meta engines. MetaCrawler [?]. is an example in this category. Another interesting site is Askjeeves [?], which answers questions in natural English language, such as \what is the meaning of search engine" and \where do I nd Java networking books". Interestingly, Infoseek has a standalone product called \Infoseek Express" that is a meta engine and runs as a plug-in to both Netscape and Internet Explorer browsers. 3

4 Figure 1: How search engines work (from 3 General purpose search engines In this section, I will survey several major general purpose search engines and compare them based on certain criteria. Some basic concepts and architectures are also discussed. 3.1 How do they work? Normally, a search engine has three main components: 1. a software robot to roam around the Web; 2. everything the robot nds out goes into the second part: a giant index; 3. search engine software is the third part of a search engine. This is the program that sifts through the millions of pages recorded in the index to nd matches to a search and rank them in order of what it believes is most relevant. Figure?? shows a general picture of how search engines work. 3.2 How do they dier? Not all search engines are created equal. There are some important criteria that we can use to distinguish them. Here, I present the dierences among six major search engines: AltaVista, HotBot, Infoseek, Lycos, Excite, and Northern Light. Size Size matters. Although it is very unlikely that any of the existing search engines can cover 4

5 Figure 2: Search engine sizes (as of May 1, 1999, from the whole Web, all the search engines are trying to index as many web pages as possible. Figure?? shows a recent report on the index sizes of major search engines. Figure?? shows the trend of size increases for each of them over time. Index scope Among the six search engines, all of them provide search facilities on the Web, Newsgroups, and Addresses (Business, Residential, ). Only Northern Light provides FTP search. Indexing method In terms of indexing methods, they can be based on full text, keyword, or even natural language. All of the six major search engines use full text indices. Only Lycos and Excite support natural language indexing method. Search Logic 5

6 Figure 3: Search engine sizes over time (from Search Engine AltaVista Hotbot Infoseek Lycos Excite Search Logic Default is \OR"; advanced searching supports full boolean terms: OR, AND, NOT, NEAR; using upper case letters forces an exact match, lower case will search both upper and lower case; uses \*" to truncate Default is \AND"; boolean searching supports AND, OR, and NOT; advanced searching supports \must", \should" and \must not" semantics; uses \*" to truncate Default is \OR", allows variations of AND and NOT; searches are case sensitive Default is \OR"; advanced searching supports AND, OR, NOT and rich relationship operators: ADJ, NEAR, FAR and BEFORE Default is \AND", supports OR and AND NOT; recognizes upper case letters on word beginnings as names; terms are automatically searched as word prexes or roots Northern Light Default is \AND", allows OR and NOT; supports both \%" and \*" as truncation symbols; boolean terms OR and NOT may be inserted between words or phrases in quotes In addition, all these search engines support \+" as including words and \-" as excluding words, as well as double quotes \" to enclose a phrase. 6

7 Power features Search Engine AltaVista Hotbot Infoseek Lycos Excite Northern Light Power feature supports searching in 25 languages (including Asian languages) and translation between 5 Romanized languages; renes on date supports searching in 9 Romanized languages; renes on date, le extension, location/domain, page depth and word stemming (grammatic variations); searches within results renes on domain and location; searches within results supports searching in 15 Romanized languages; searches within results; features powerful MP3 searching supports searching in 9 languages (including Asian languages); renes on domain and location; has adult content lter; searches similar pages supports searching in 5 Romanized languages; renes on date, web source type, subject and location; categorizes results as \custom search folders" to further rene results, very impressive Ranking algorithm There are several schemes that the search engines use to determine a page's relevance ranking: 1. Location of keywords Words occurring near the top of the page are usually given a greater relevance score by the search engines. Also, some search engines give priority to pages on which the keywords (the words you're searching for) appear in the title of the page, or in the URL (the web address). 2. Frequency of Keywords The number of times a word appears on the page will boost that page forward in the ranking of pages provided by the Search Engine. But don't abuse it, because most of the search engines have anti-spam techniques to identify whether a page is \spamming". If so, they may reject to index the page, or degrade the relevance ranking. 3. Popularity Some Search Engines also give extra points to pages that have other pages linked to them. Excite, Infoseek, Lycos, Google, and Webcrawler consider the \popularity" of a site. 4. Meta Tags Meta Tags are description and content keywords created by the site designer and embedded in the source code. These Meta Tags are considered by Hotbot and Infoseek, but ignored by Excite and AltaVista. 5. Paid listings GoTo.com was the rst one using paid listings. Then AltaVista followed its lead and 7

8 introduced paid listings in April, 1999, hoping that could improve relevancy better than any algorithm. 3.3 What is the problem? Although search engines are great and they have made search on the Net a lot easier than before, there are still some unsolved questions. There are too many of them Nobody really knows how many search engines are out there on the Internet. I only named a few major ones in this report. There are perhaps hundreds of specialized search engines and meta engines available on the Net. No one will try to learn how to use all the search engines. It is most likely that one is prone to stick to one (probably the rst one ever used) particular search engine and do not use others at all (unless it is really necessary). There is no standard Every search engine has its own robot and indexing algorithm. There have been eorts trying to bring all the major players together and settle a interchangeable search engine standard [?]. But it was only adopted by a few vendors. If there were a standard, document accuracy, vendor independence and engine functionality would have improved a lot more. Result duplication Every search engine suers the duplicates in the search results. This is so because of the nature of web pages (documents link to each other). It's also because elimination of duplicates in the database tends to be a tedious work to do. As the main goal of search engines is the speed, duplicates are overlooked. Query expressiveness All search engines use keyword-based search, because it is easy both for the users and for the software vendors. Some search engines use enhanced boolean expressions. But they are still based on keywords. Due to this limitation, we won't expect the same accuracy as we do in database SQL query results. Multimedia search Most of the existing search engines are searching text documents. There have not been general-purpose content-based multimedia search engines on the Internet, although there are some prototype systems [?]. Multimedia applications, especially video on-demand applications are already emerging (such as CNN's videoselect). There will be a need to search audio, image, and video contents on the Net. 4 Summary As discussed in previous sections, search engines are so widely used and they have provided us much convenience. Along with the growth of the Internet, I would also expect the growth of the 8

9 search engine market. While the general search engines continue to enhance their performance, there will be more specialized search engines emerging on the Internet or enterprise intranets. With speed and size still the main challenges with the search engines, relevance is becoming more and more important. AI techniques should be used in the development of search engines to provide more intelligent services. Some new trends include changing the \pull" nature of existing search engines to be \push-enabled". If SQL-like search languages are to be used, there will be a big challenge for distributed query optimization, routing, and scheduling also. References [1] A history of search engines. [2] Archie. [3] Askjeeves. [4] AltaVista. [5] Excite. [6] Hotbot. [7] Infoseek. [8] Inktomi. [9] LookSmart. [10] Lycos. [11] Metacrawler. [12] Northern Light. [13] One-stop shopping. [14] Search Engine Showdown. [15] Search Engine Watch. [16] L. Gravano, C.K. Chang, H. Garcia-Molina, C. Lagoze, A. Paepcke STARTS: Stanford Protocol Proposal for Internet Meta-Searching In Proceedings of the 1997 ACM SIGMOD International Conference On Management of Data, May [17] WebSEEk. [18] Yahoo!. 9

Directory Search Engines Searching the Yahoo Directory

Directory Search Engines Searching the Yahoo Directory Searching on the WWW Directory Oriented Search Engines Often looking for some specific information WWW has a growing collection of Search Engines to aid in locating information The Search Engines return

More information

Google Inc. The world s leading Internet search engine. MarketLine Case Study. Reference Code: ML Publication Date: March 2012

Google Inc. The world s leading Internet search engine. MarketLine Case Study. Reference Code: ML Publication Date: March 2012 MarketLine Case Study Google Inc. The world s leading Internet search engine Reference Code: ML00001-091 Publication Date: March 2012 WWW.MARKETLINE.COM MARKETLINE. THIS PROFILE IS A LICENSED PRODUCT AND

More information

Search Engines. Information Technology and Social Life March 2, Ask difference between a search engine and a directory

Search Engines. Information Technology and Social Life March 2, Ask difference between a search engine and a directory Search Engines Information Technology and Social Life March 2, 2005 Ask difference between a search engine and a directory 1 Search Engine History A search engine is a program designed to help find files

More information

AN OVERVIEW OF SEARCHING AND DISCOVERING WEB BASED INFORMATION RESOURCES

AN OVERVIEW OF SEARCHING AND DISCOVERING WEB BASED INFORMATION RESOURCES Journal of Defense Resources Management No. 1 (1) / 2010 AN OVERVIEW OF SEARCHING AND DISCOVERING Cezar VASILESCU Regional Department of Defense Resources Management Studies Abstract: The Internet becomes

More information

Almost 80 percent of new site visits begin at search engines. A couple of years back Nielsen published a list of popular search engines.

Almost 80 percent of new site visits begin at search engines. A couple of years back Nielsen published a list of popular search engines. SEO OverView We have a problem, we want people to visit our Web site, that's the purpose after all to bring people to our website and increase traffic inorder to buy soundspirit products and learn more

More information

Today we shall be starting discussion on search engines and web crawler.

Today we shall be starting discussion on search engines and web crawler. Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #38 Search Engines and Web Crawler :: Part 1 Today we shall

More information

Introduction. What do you know about web in general and web-searching in specific?

Introduction. What do you know about web in general and web-searching in specific? WEB SEARCHING Introduction What do you know about web in general and web-searching in specific? Web World Wide Web (or WWW, It is called a web because the interconnections between documents resemble a

More information

Structure Objectives Introduction Search Engines: Definitions Search Engines: Evolution How Do Search Engines Work?

Structure Objectives Introduction Search Engines: Definitions Search Engines: Evolution How Do Search Engines Work? UNIT 13 SEARCH ENGINES Search Engines Structure 13.0 Objectives 13.1 Introduction 13.2 Search Engines: Definitions 13.3 Search Engines: Evolution 13.4 How Do Search Engines Work? 13.4.1 The Robot or Spider

More information

Module 1: Internet Basics for Web Development (II)

Module 1: Internet Basics for Web Development (II) INTERNET & WEB APPLICATION DEVELOPMENT SWE 444 Fall Semester 2008-2009 (081) Module 1: Internet Basics for Web Development (II) Dr. El-Sayed El-Alfy Computer Science Department King Fahd University of

More information

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost

More information

Web Search Strategy/Behavior. Meenu Sharma. Librarian Canadian Institute for International studies C-2, Phase-1, Industrial Area, Mohali

Web Search Strategy/Behavior. Meenu Sharma. Librarian Canadian Institute for International studies C-2, Phase-1, Industrial Area, Mohali By Meenu Sharma Librarian Canadian Institute for International studies C-2, Phase-1, Industrial Area, Mohali E-mail: meenusharma982@yahoo.com ABSTRACT A library is a place where the right information is

More information

Skill Area 209: Use Internet Technology. Software Application (SWA)

Skill Area 209: Use Internet Technology. Software Application (SWA) Skill Area 209: Use Internet Technology Software Application (SWA) Skill Area 209.1 Use Browser for Research (10hrs) 209.1.1 Familiarise with the Environment of Selected Browser Internet Technology The

More information

Computer Fundamentals : Pradeep K. Sinha& Priti Sinha

Computer Fundamentals : Pradeep K. Sinha& Priti Sinha Computer Fundamentals Pradeep K. Sinha Priti Sinha Chapter 18 The Internet Slide 1/23 Learning Objectives In this chapter you will learn about: Definition and history of the Internet Its basic services

More information

5 Choosing keywords Initially choosing keywords Frequent and rare keywords Evaluating the competition rates of search

5 Choosing keywords Initially choosing keywords Frequent and rare keywords Evaluating the competition rates of search Seo tutorial Seo tutorial Introduction to seo... 4 1. General seo information... 5 1.1 History of search engines... 5 1.2 Common search engine principles... 6 2. Internal ranking factors... 8 2.1 Web page

More information

What Is Voice SEO and Why Should My Site Be Optimized For Voice Search?

What Is Voice SEO and Why Should My Site Be Optimized For Voice Search? What Is Voice SEO and Why Should My Site Be Optimized For Voice Search? Voice search is a speech recognition technology that allows users to search by saying terms aloud rather than typing them into a

More information

How to Get Your Website Listed on Major Search Engines

How to Get Your Website Listed on Major Search Engines Contents Introduction 1 Submitting via Global Forms 1 Preparing to Submit 2 Submitting to the Top 3 Search Engines 3 Paid Listings 4 Understanding META Tags 5 Adding META Tags to Your Web Site 5 Introduction

More information

Don't Become Roadkill on the Information Superhighway: Dealing with Information Overload

Don't Become Roadkill on the Information Superhighway: Dealing with Information Overload University of Kentucky UKnowledge Library Presentations University of Kentucky Libraries 11-1996 Don't Become Roadkill on the Information Superhighway: Dealing with Information Overload Antoinette Paris

More information

Understanding SEO IN THIS PART

Understanding SEO IN THIS PART 75002c01.qxd:Layout 1 11/7/07 9:30 AM Page 1 MA TE RI AL Understanding SEO S PY R IG HT ED earch engine optimization (SEO) is such a broad term. It can be quite overwhelming if you try to take the whole

More information

The Ultimate Digital Marketing Glossary (A-Z) what does it all mean? A-Z of Digital Marketing Translation

The Ultimate Digital Marketing Glossary (A-Z) what does it all mean? A-Z of Digital Marketing Translation The Ultimate Digital Marketing Glossary (A-Z) what does it all mean? In our experience, we find we can get over-excited when talking to clients or family or friends and sometimes we forget that not everyone

More information

A web directory lists web sites by category and subcategory. Web directory entries are usually found and categorized by humans.

A web directory lists web sites by category and subcategory. Web directory entries are usually found and categorized by humans. 1 After WWW protocol was introduced in Internet in the early 1990s and the number of web servers started to grow, the first technology that appeared to be able to locate them were Internet listings, also

More information

Search Engine Technology. Mansooreh Jalalyazdi

Search Engine Technology. Mansooreh Jalalyazdi Search Engine Technology Mansooreh Jalalyazdi 1 2 Search Engines. Search engines are programs viewers use to find information they seek by typing in keywords. A list is provided by the Search engine or

More information

Pay-Per-Click Advertising Special Report

Pay-Per-Click Advertising Special Report Pay-Per-Click Advertising Special Report Excerpted from 2005 by Kenneth A. McArthur All Rights Reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted by any means,

More information

SEARCH ENGINE INSIDE OUT

SEARCH ENGINE INSIDE OUT SEARCH ENGINE INSIDE OUT From Technical Views r86526020 r88526016 r88526028 b85506013 b85506010 April 11,2000 Outline Why Search Engine so important Search Engine Architecture Crawling Subsystem Indexing

More information

Web Search Basics Introduction to Information Retrieval INF 141/ CS 121 Donald J. Patterson

Web Search Basics Introduction to Information Retrieval INF 141/ CS 121 Donald J. Patterson Web Search Basics Introduction to Information Retrieval INF 141/ CS 121 Donald J. Patterson Content adapted from Hinrich Schütze http://www.informationretrieval.org Overview Overview Introduction Classic

More information

Web Search. Web Spidering. Introduction

Web Search. Web Spidering. Introduction Web Search. Web Spidering Introduction 1 Outline Information Retrieval applied on the Web The Web the largest collection of documents available today Still, a collection Should be able to apply traditional

More information

Administrivia. Crawlers: Nutch. Course Overview. Issues. Crawling Issues. Groups Formed Architecture Documents under Review Group Meetings CSE 454

Administrivia. Crawlers: Nutch. Course Overview. Issues. Crawling Issues. Groups Formed Architecture Documents under Review Group Meetings CSE 454 Administrivia Crawlers: Nutch Groups Formed Architecture Documents under Review Group Meetings CSE 454 4/14/2005 12:54 PM 1 4/14/2005 12:54 PM 2 Info Extraction Course Overview Ecommerce Standard Web Search

More information

Your Website as a Marketing Tool. Randy L. Martin R. L. Martin and Associates

Your Website as a Marketing Tool. Randy L. Martin R. L. Martin and Associates Your Website as a Marketing Tool Randy L. Martin R. L. Martin and Associates Getting Started Register Your Domain Name Pick something that people can associate with your company Pick something easy to

More information

Provided by TryEngineering.org -

Provided by TryEngineering.org - Provided by TryEngineering.org - Lesson Focus Lesson focuses on exploring how the development of search engines has revolutionized Internet. Students work in teams to understand the technology behind search

More information

AN SEO GUIDE FOR SALONS

AN SEO GUIDE FOR SALONS AN SEO GUIDE FOR SALONS AN SEO GUIDE FOR SALONS Set Up Time 2/5 The basics of SEO are quick and easy to implement. Management Time 3/5 You ll need a continued commitment to make SEO work for you. WHAT

More information

Locating Resources for Your Unit Plan. Discuss copyright and fair use guidelines

Locating Resources for Your Unit Plan. Discuss copyright and fair use guidelines Locating Resources for Your Unit Plan Objectives T E A C H E R S W I L L Discuss copyright and fair use guidelines Use search engines and directories to locate information on the Internet for their unit

More information

extreme searching: how to avoid extreme frustration and bird walks presented by Kathy Schrock Overview The Problems

extreme searching: how to avoid extreme frustration and bird walks presented by Kathy Schrock Overview The Problems extreme searching: how to avoid extreme frustration and bird walks presented by Kathy Schrock kathy@kathyschrock.net Overview Problems with searching Three main types of search tools The top search engines

More information

CSC105, Introduction to Computer Science I. Introduction and Background. search service Web directories search engines Web Directories database

CSC105, Introduction to Computer Science I. Introduction and Background. search service Web directories search engines Web Directories database CSC105, Introduction to Computer Science Lab02: Web Searching and Search Services I. Introduction and Background. The World Wide Web is often likened to a global electronic library of information. Such

More information

Why is Search Engine Optimisation (SEO) important?

Why is Search Engine Optimisation (SEO) important? Why is Search Engine Optimisation (SEO) important? With literally billions of searches conducted every month search engines have essentially become our gateway to the internet. Unfortunately getting yourself

More information

An Overview of Search Engine. Hai-Yang Xu Dev Lead of Search Technology Center Microsoft Research Asia

An Overview of Search Engine. Hai-Yang Xu Dev Lead of Search Technology Center Microsoft Research Asia An Overview of Search Engine Hai-Yang Xu Dev Lead of Search Technology Center Microsoft Research Asia haixu@microsoft.com July 24, 2007 1 Outline History of Search Engine Difference Between Software and

More information

WordPress SEO. Basic SEO Practices Using WordPress. Leo Wadsworth LeoWadsworth.com

WordPress SEO. Basic SEO Practices Using WordPress. Leo Wadsworth LeoWadsworth.com Basic SEO Practices Using WordPress Leo Wadsworth LeoWadsworth.com Copyright 2012, by Leo Wadsworth, all rights reserved. Unless you have specifically purchased additional rights, this work is for personal

More information

SE Workshop PLAN. What is a Search Engine? Components of a SE. Crawler-Based Search Engines. How Search Engines (SEs) Work?

SE Workshop PLAN. What is a Search Engine? Components of a SE. Crawler-Based Search Engines. How Search Engines (SEs) Work? PLAN SE Workshop Ellen Wilson Olena Zubaryeva Search Engines: How do they work? Search Engine Optimization (SEO) optimize your website How to search? Tricks Practice What is a Search Engine? A page on

More information

ELEVATESEO. INTERNET TRAFFIC SALES TEAM PRODUCT INFOSHEETS. JUNE V1.0 WEBSITE RANKING STATS. Internet Traffic

ELEVATESEO. INTERNET TRAFFIC SALES TEAM PRODUCT INFOSHEETS. JUNE V1.0 WEBSITE RANKING STATS. Internet Traffic SALES TEAM PRODUCT INFOSHEETS. JUNE 2017. V1.0 1 INTERNET TRAFFIC Internet Traffic Most of your internet traffic will be provided from the major search engines. Social Media services and other referring

More information

WWW and Web Browser. 6.1 Objectives In this chapter we will learn about:

WWW and Web Browser. 6.1 Objectives In this chapter we will learn about: WWW and Web Browser 6.0 Introduction WWW stands for World Wide Web. WWW is a collection of interlinked hypertext pages on the Internet. Hypertext is text that references some other information that can

More information

INTERNET PORTALS DEFINITION OF PORTAL

INTERNET PORTALS DEFINITION OF PORTAL INTERNET PORTALS In order to gain an understanding of Internet portals, it is important to understand the role they play in e-commerce. What value-added services do they offer the customer? To the supplier?

More information

New Technology Briefing

New Technology Briefing New Technology Briefing Jon Bagnall is the Managing Director and founder of Planeteria, providing search engine marketing and website promotion consultancy. Search engine marketing Jon Bagnall Received

More information

Web Search Engines. 1. Introduction:

Web Search Engines. 1. Introduction: Web Search Engines T.B. Rajashekar National Centre for Science Information Indian Institute of Science Bangalore 560 012 (E-Mail: raja@ncsi.iisc.ernet.in) The World Wide Web is emerging as an all-in-one

More information

SEARCH ENGINE OPTIMIZATION Noun The process of maximizing the number of visitors to a particular website by ensuring that the site appears high on the list of results returned by a search engine such as

More information

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES DEC. 1-5 Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES Monday Overview of Databases A web search engine is a large database containing information about Web pages that have been registered

More information

THE HISTORY & EVOLUTION OF SEARCH

THE HISTORY & EVOLUTION OF SEARCH THE HISTORY & EVOLUTION OF SEARCH Duration : 1 Hour 30 Minutes Let s talk about The History Of Search Crawling & Indexing Crawlers / Spiders Datacenters Answer Machine Relevancy (200+ Factors)

More information

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai. UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index

More information

The Internet Advanced Research Projects Agency Network (ARPANET) How the Internet Works Transport Control Protocol (TCP)

The Internet Advanced Research Projects Agency Network (ARPANET) How the Internet Works Transport Control Protocol (TCP) The Internet, Intranets, and Extranets 1 The Internet The Internet is a collection of interconnected network of computers, all freely exchanging information. These computers use specialized software to

More information

FIT 100: Fluency with Information Technology

FIT 100: Fluency with Information Technology FIT 100: Fluency with Information Technology Lab 1: UW NetID, Email, Activating Student Web Pages Table of Contents: Obtain a UW Net ID (your email / web page identity):... 1 1. Setting Up An Account...

More information

ICA10105 Certificate I in Information Technology ICAU1204B. Locate and Use Relevant Online Information. (25hrs)

ICA10105 Certificate I in Information Technology ICAU1204B. Locate and Use Relevant Online Information. (25hrs) ICA10105 Certificate I in Information Technology ICAU1204B Locate and Use Relevant Online Information (25hrs) Comet Bay College Certificate I in Information Technology ICAU10105 Certificate I in Information

More information

6 WAYS Google s First Page

6 WAYS Google s First Page 6 WAYS TO Google s First Page FREE EBOOK 2 CONTENTS 03 Intro 06 Search Engine Optimization 08 Search Engine Marketing 10 Start a Business Blog 12 Get Listed on Google Maps 15 Create Online Directory Listing

More information

Chapter Ten. From Internet to Information Superhighway

Chapter Ten. From Internet to Information Superhighway Chapter Ten From Internet to Information Superhighway After reading this chapter you should be able to: Describe the nature of the Internet and the variety of functions it performs Discuss several software

More information

Introduction to Information Retrieval. Hongning Wang

Introduction to Information Retrieval. Hongning Wang Introduction to Information Retrieval Hongning Wang CS@UVa What is information retrieval? 2 Why information retrieval Information overload It refers to the difficulty a person can have understanding an

More information

Internet The full name of Internet is the International Network. Internet is the world largest computer network. It is the network of network. Interne

Internet The full name of Internet is the International Network. Internet is the world largest computer network. It is the network of network. Interne Internet Basics By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore Email: bhu261@gmail.com Internet The full name of Internet is the International Network.

More information

Search & Google. Melissa Winstanley

Search & Google. Melissa Winstanley Search & Google Melissa Winstanley mwinst@cs.washington.edu The size of data Byte: a single character Kilobyte: a short story, a simple web html file Megabyte: a photo, a short song Gigabyte: a movie,

More information

Search Engine Optimization (SEO) using HTML Meta-Tags

Search Engine Optimization (SEO) using HTML Meta-Tags 2018 IJSRST Volume 4 Issue 9 Print ISSN : 2395-6011 Online ISSN : 2395-602X Themed Section: Science and Technology Search Engine Optimization (SEO) using HTML Meta-Tags Dr. Birajkumar V. Patel, Dr. Raina

More information

Chapter 6: Information Retrieval and Web Search. An introduction

Chapter 6: Information Retrieval and Web Search. An introduction Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods

More information

Introduction to Ardora

Introduction to Ardora Ardora is an authoring software focused mainly on the development of educational content for the Web. Its main purpose is that teachers focus their efforts on the methodological and didactic aspects of

More information

Basic Internet Skills

Basic Internet Skills The Internet might seem intimidating at first - a vast global communications network with billions of webpages. But in this lesson, we simplify and explain the basics about the Internet using a conversational

More information

ONLINE EVALUATION FOR: Company Name

ONLINE EVALUATION FOR: Company Name ONLINE EVALUATION FOR: Company Name Address Phone URL media advertising design P.O. Box 2430 Issaquah, WA 98027 (800) 597-1686 platypuslocal.com SUMMARY A Thank You From Platypus: Thank you for purchasing

More information

Search Engine Optimization and Placement:

Search Engine Optimization and Placement: Search Engine Optimization and Placement: An Internet Marketing Course for Webmasters Reneé Kennedy Terry Kent The Write Market Search Engine Optimization and Placement: Reneé Kennedy Terry Kent The Write

More information

A COMPARATIVE STUDY OF BYG SEARCH ENGINES

A COMPARATIVE STUDY OF BYG SEARCH ENGINES American Journal of Engineering Research (AJER) e-issn: 2320-0847 p-issn : 2320-0936 Volume-2, Issue-4, pp-39-43 www.ajer.us Research Paper Open Access A COMPARATIVE STUDY OF BYG SEARCH ENGINES Kailash

More information

Web site Image database. Web site Video database. Web server. Meta-server Meta-search Agent. Meta-DB. Video query. Text query. Web client.

Web site Image database. Web site Video database. Web server. Meta-server Meta-search Agent. Meta-DB. Video query. Text query. Web client. (Published in WebNet 97: World Conference of the WWW, Internet and Intranet, Toronto, Canada, Octobor, 1997) WebView: A Multimedia Database Resource Integration and Search System over Web Deepak Murthy

More information

The Quest for Information: A Guide to Searching the Internet

The Quest for Information: A Guide to Searching the Internet Volume 2 Number 4 November 15, 2001 The Quest for Information: A Guide to Searching the Internet Abstract Searching the Internet effectively necessitates the use of contemporary software programs commonly

More information

Exploring Advanced Search Features on the web

Exploring Advanced Search Features on the web Exploring Advanced Search Features on the web Doc 9.82 Ver 1 Netskills original material adapted by October 2005 Central Computing Services Prerequisites This document assumes that you are familiar with

More information

High Quality Inbound Links For Your Website Success

High Quality Inbound Links For Your Website Success Axandra How To Get ö Benefit from tested linking strategies and get more targeted visitors. High Quality Inbound Links For Your Website Success How to: ü Ü Build high quality inbound links from related

More information

Accessibility of INGO FAST 1997 ARTVILLE, LLC. 32 Spring 2000 intelligence

Accessibility of INGO FAST 1997 ARTVILLE, LLC. 32 Spring 2000 intelligence Accessibility of INGO FAST 1997 ARTVILLE, LLC 32 Spring 2000 intelligence On the Web Information On the Web Steve Lawrence C. Lee Giles Search engines do not index sites equally, may not index new pages

More information

Discovery services: next generation of searching scholarly information

Discovery services: next generation of searching scholarly information Discovery services: next generation of searching scholarly information Article (Unspecified) Keene, Chris (2011) Discovery services: next generation of searching scholarly information. Serials, 24 (2).

More information

Today we show how a search engine works

Today we show how a search engine works How Search Engines Work Today we show how a search engine works What happens when a searcher enters keywords What was performed well in advance Also explain (briefly) how paid results are chosen If we

More information

SEO. Definitions/Acronyms. Definitions/Acronyms

SEO. Definitions/Acronyms. Definitions/Acronyms Definitions/Acronyms SEO Search Engine Optimization ITS Web Services September 6, 2007 SEO: Search Engine Optimization SEF: Search Engine Friendly SERP: Search Engine Results Page PR (Page Rank): Google

More information

Below, we will walk through the three main elements of the algorithm, which include Domain Attributes, On-Page and Off-Page factors.

Below, we will walk through the three main elements of the algorithm, which include Domain Attributes, On-Page and Off-Page factors. Search engine optimization is the active practicing of improving your websites ability to rank in the natural search engine results. Each of the major search engines have a proprietary algorithm that makes

More information

HOW TO USE THE INTERNET TO FIND THE PROSTATE CANCER INFORMATION YOU WANT

HOW TO USE THE INTERNET TO FIND THE PROSTATE CANCER INFORMATION YOU WANT 1 HOW TO USE THE INTERNET TO FIND THE PROSTATE CANCER INFORMATION YOU WANT (TIPS FOR EVERYONE EVEN IF YOU DON T OWN A COMPUTER ) by Robert Young Many feel they are unable to access prostate cancer information

More information

The ebuilders Guide to selecting a Web Designer

The ebuilders Guide to selecting a Web Designer The ebuilders Guide to selecting a Web Designer With the following short guide we hope to give you and your business a better grasp of how to select a web designer. We also include a short explanation

More information

A Double Edged Sword. December 10, Originally published March 15, 1996 in Web Review magazine.

A Double Edged Sword. December 10, Originally published March 15, 1996 in Web Review magazine. A Double Edged Sword December 10, 2009 Originally published March 15, 1996 in Web Review magazine. Architecturally speaking, frames are one of the most intriguing HTML extensions around. Unfortunately,

More information

Topics Covered: 6. SSL Certificates. 1. Website Design 2. Domain Names 3. Hosting 4. Data Entry 5. SEO. 7. Website Updates.

Topics Covered: 6. SSL Certificates. 1. Website Design 2. Domain Names 3. Hosting 4. Data Entry 5. SEO. 7. Website Updates. Did You Know? Jon Web Design has all in one packages that combine all necessary costs into one easy to pay monthly fee with no setup and design costs. Sitebuilder For your choice of monthly plan we setup

More information

Introduction to the Internet and Web

Introduction to the Internet and Web Introduction to the Internet and Web Internet It is the largest network in the world that connects hundreds of thousands of individual networks all over the world. The popular term for the Internet is

More information

To access a search engine go to the search engine s web site (i.e. yahoo.com).

To access a search engine go to the search engine s web site (i.e. yahoo.com). L02. Internet Search Page 1 of 6 L02. INTERNET SEARCH OBJECTIVES Students will be able to: Describe what a web search engine does. Describe how a web search engine works. Develop search strategies to effectively

More information

Google technology for teachers

Google technology for teachers Google technology for teachers Sandhya Digambar Shinde Assistant Professor, Department of Library and Information Science, Jayakar Library, University of Pune-411007 Pune, Maharashtra, India srmaharnor@unipune.ac.in

More information

Table of Contents. - Introduction. - Step 1: Design. - Step 2: Content. - Step 3: Mapping. - Step 4: Social Media. - Step 5: Webmaster

Table of Contents. - Introduction. - Step 1: Design. - Step 2: Content. - Step 3: Mapping. - Step 4: Social Media. - Step 5: Webmaster Table of Contents - Introduction - Step 1: Design - Step 2: Content - Step 3: Mapping - Step 4: Social Media - Step 5: Webmaster - 3 Fantastic Google Tips - The Take Away 2 2015 Plumbing Webmasters, All

More information

What is SEO? How to improve search engines ranking (SEO)? Keywords Domain name Page URL address

What is SEO? How to improve search engines ranking (SEO)? Keywords Domain name Page URL address What is SEO? How to improve search engines ranking (SEO)? Keywords Domain name Page URL address Title and description tags Web page title tag Page description Principal title page (h1) Alternative texts

More information

ICA10105 Certificate I in Information Technology ICAU1133B. Send and Retrieve Information Using Web Browsers and . (20hrs)

ICA10105 Certificate I in Information Technology ICAU1133B. Send and Retrieve Information Using Web Browsers and  . (20hrs) ICA10105 Certificate I in Information Technology ICAU1133B Send and Retrieve Information Using Web Browsers and Email (20hrs) COMET BAY COLLEGE Certificate I in Information Technology ICAU1133B Send and

More information

Website Designing for

Website Designing for 5 Website Designing for www.scap.com.pk Complete Proposal for website designing and associated web solutions of www.scap.com.pk. The web solutions included, Search Engine Optimization and web hosting.

More information

Fall 2013 Harvard Library User Survey Summary December 18, 2013

Fall 2013 Harvard Library User Survey Summary December 18, 2013 Fall 2013 Harvard Library User Survey Summary December 18, 2013 The Discovery Platform Investigation group placed links to a User Survey on the four major Harvard Library web sites (HOLLIS, HOLLIS Classic,

More information

Most, but not all, state associations link to the VU web site.

Most, but not all, state associations link to the VU web site. 1 Most, but not all, state associations link to the VU web site. The graphic above was taken from the Arizona association which is one of the biggest promoters of the VU. If you Googled virtual university

More information

CURZON PR BUYER S GUIDE WEBSITE DEVELOPMENT

CURZON PR BUYER S GUIDE WEBSITE DEVELOPMENT CURZON PR BUYER S GUIDE WEBSITE DEVELOPMENT Website Development WHAT IS WEBSITE DEVELOPMENT? This is the development of a website for the Internet (World Wide Web) Website development can range from developing

More information

GMAIL BEGINNERS GUIDE

GMAIL BEGINNERS GUIDE GMAIL BEGINNERS GUIDE A Little History: The year is 2004 and three email services dominate the market. They were Hotmail, AOL and Yahoo mail. After extensive testing, Google decides to branch beyond being

More information

Objectives. Introduction to HTML. Objectives. Objectives

Objectives. Introduction to HTML. Objectives. Objectives Objectives Introduction to HTML Developing a Basic Web Page Review the history of the Web, the Internet, and HTML. Describe different HTML standards and specifications. Learn about the basic syntax of

More information

I N D E X GOOGLE DROPS KEYWORD TOOL. REPLACES IT WITH KEYWORD PLANNER BENEFITS OF GUEST BLOGGING FOR SEO

I N D E X GOOGLE DROPS KEYWORD TOOL. REPLACES IT WITH KEYWORD PLANNER BENEFITS OF GUEST BLOGGING FOR SEO I N D E X 01 02 03 04 05 GOOGLE DROPS KEYWORD TOOL. REPLACES IT WITH KEYWORD PLANNER BENEFITS OF GUEST BLOGGING FOR SEO GOOGLE INTRODUCES MANUAL PENALTY REPORTING IN WEBMASTER TOOLS ANALYZE AND OPTIMIZE

More information

The Future For Banking. Jerry Gross Group Executive Westpac Banking Corporation

The Future For Banking. Jerry Gross Group Executive Westpac Banking Corporation The Future For Banking Jerry Gross Group Executive Westpac Banking Corporation jgross@westpac.com.au The Banking Environment We are in the Services Age! It s a Buyer-Centric world Emergence of the Experimental

More information

Internet Power Searching: The Advanced Manual

Internet Power Searching: The Advanced Manual Internet Power Searching: The Advanced Manual Phil Bradley NEAL-SCHUMAN PUBLISHERS INC. NEW YORK, LONDON Contents зт figures асе An introduction to the Internet An overview of the Internet What the Internet

More information

Next-Generation Standards Management with IHS Engineering Workbench

Next-Generation Standards Management with IHS Engineering Workbench ENGINEERING & PRODUCT DESIGN Next-Generation Standards Management with IHS Engineering Workbench The addition of standards management capabilities in IHS Engineering Workbench provides IHS Standards Expert

More information

Endless Monetization

Endless Monetization Hey Guys, So, today we want to bring you a few topics that we feel compliment's the recent traffic, niches and keyword discussions. Today, we want to talk about a few different things actually, ranging

More information

Duplicate and customize an existing kahoot to fit your needs. Launch and host a kahoot game in your class

Duplicate and customize an existing kahoot to fit your needs. Launch and host a kahoot game in your class Course 1 Get started and discover with Kahoot! Welcome to the first course of the Kahoot! Certified program! Before we get started, please be sure not to share these guides further, as they are only for

More information

SEO and Monetizing The Content. Digital 2011 March 30 th Thinking on a different level

SEO and Monetizing The Content. Digital 2011 March 30 th Thinking on a different level SEO and Monetizing The Content Digital 2011 March 30 th 2011 Getting Found and Making the Most of It 1. Researching target Audience (Keywords) 2. On-Page Optimisation (Content) 3. Titles and Meta Tags

More information

Internet. Telephone Line

Internet. Telephone Line Internet The Internet (International Network) is a network of computers from all over the world linked together by telephone lines, fibre optic cables and satellite. Millions of users from all around the

More information

COPYRIGHTED MATERIAL AN ATOMY OF A SE ARCH EN GINE C HAPTER 1

COPYRIGHTED MATERIAL AN ATOMY OF A SE ARCH EN GINE C HAPTER 1 C HAPTER 1 AN ATOMY OF A SE ARCH EN GINE The difference between organic and paid searches and where you should focus your time and resources. COPYRIGHTED MATERIAL 5 In this chapter, you will learn: How

More information

Instructor: Kathleen Scheaffer Content: Adopted from Gwen Harris

Instructor: Kathleen Scheaffer Content: Adopted from Gwen Harris WEB SEARCHING Instructor: Kathleen Scheaffer Content: Adopted from Gwen Harris http://plc.fis.utoronto.ca/courses/gharris/fis/workshop/ - unless otherwise noted Agenda Introduction Definition of Search

More information

Clean Living: Eliminating Near-Duplicates in Lifetime Personal Storage

Clean Living: Eliminating Near-Duplicates in Lifetime Personal Storage Clean Living: Eliminating Near-Duplicates in Lifetime Personal Storage Zhe Wang Princeton University Jim Gemmell Microsoft Research September 2005 Technical Report MSR-TR-2006-30 Microsoft Research Microsoft

More information

SEO: SEARCH ENGINE OPTIMISATION

SEO: SEARCH ENGINE OPTIMISATION SEO: SEARCH ENGINE OPTIMISATION SEO IN 11 BASIC STEPS EXPLAINED What is all the commotion about this SEO, why is it important? I have had a professional content writer produce my content to make sure that

More information

Full Website Audit. Conducted by Mathew McCorry. Digimush.co.uk

Full Website Audit. Conducted by Mathew McCorry. Digimush.co.uk Full Website Audit Conducted by Mathew McCorry Digimush.co.uk 1 Table of Contents Full Website Audit 1 Conducted by Mathew McCorry... 1 1. Overview... 3 2. Technical Issues... 4 2.1 URL Structure... 4

More information

EBOOK. On-Site SEO Made MSPeasy Everything you need to know about Onsite SEO

EBOOK. On-Site SEO Made MSPeasy Everything you need to know about Onsite SEO EBOOK On-Site SEO Made MSPeasy Everything you need to know about Onsite SEO K SEO easy ut Onsite SEO What is SEO & How is it Used? SEO stands for Search Engine Optimisation. The idea of SEO is to improve

More information

Information Retrieval Spring Web retrieval

Information Retrieval Spring Web retrieval Information Retrieval Spring 2016 Web retrieval The Web Large Changing fast Public - No control over editing or contents Spam and Advertisement How big is the Web? Practically infinite due to the dynamic

More information