How are XML-based Marc21 and Dublin Core Records Indexed and ranked by General Search Engines in Dynamic Online Environments?
|
|
- Sandra Wheeler
- 6 years ago
- Views:
Transcription
1 How are XML-based Marc21 and Dublin Core Records Indexed and ranked by General Search Engines in Dynamic Online Environments? A. Hossein Farajpahlou Professor, Dept. Lib. and Info. Sci., Shahid Chamran University, Ahvaz, Iran. Faeze Tabatabai MLS graduate, Dept. Lib. and Info. Sci., Shahid Chamran University, Ahvaz, Iran. Abstract Purpose - The Purpose of this research was to examine the indexing quality and ranking of XML content objects containing Dublin Core and MARC 21 metadata elements in dynamic online information environments by general search engines like Google and Yahoo!. Design/methodology/approach- 100 XML content objects were analyzed in two groups: those with DCXML elements and those with MARCXML elements, which were published on the website from late July 2009 till June Data was collected during April 2010 by means of a checklist. The website was introduced to Google & Yahoo! search engines. Google search engine was able to retrieve fully all the content objects during the study period through their Dublin Core and MARC 21 metadata elements; Yahoo! search engine, however, didn't respond at all. The indexing quality of metadata elements embedded in content objects as in a dynamic online information environment, and their indexing and ranking capabilities were compared and examined. Findings- Results of the study showed that all Dublin Core and MARC 21 metadata elements were indexed by Google search engine, and that there was no observed difference between indexing quality and ranking with DCXML and MARCXML metadata elements in dynamic online information environments as performed by Google. All in all, results of the study revealed that neither the XML-based Dublin Core Metadata Initiative, nor MARC 21 demonstrate any preference in regard with accession in dynamic online information environments through Google search engine. Practical Implications results of the present study would provide useful information as well as a basis for search engine designers who are involved with creation of indexing software. Originality/Value the present study was conducted for the first time in dynamic environments using XML-based metadata elements. Therefore, it can provide ground for further studies of the kind. Keywords- Dublin Core, MARC 21, Indexing, ranking, dynamic online environments, Google, XML. Introduction In line with recent developments in information and communication technology, especially in regards with the World Wide Web on the one hand, and the global developments and increase in the production of scientific information on the other, we are
2 witnessing increasing growth and improvement in different dynamic online information databases. They contain content objects and up-to-date scientific sources in different branches of human knowledge. Therefore, the significance of knowledge and information classification, as a basic issue in librarianship, is always considered by experts in this field. As a result, a vast amount of extraordinary activities and researches have been conducted on development of metadata initiatives and standards which are based on nowadays needs of various domains. In other words, the need for application of metadata initiative and standards is now unavoidably associated with ongoing developments in digital libraries and dynamic online information databases. One of the most important metadata schemes which has made itself compatible to dynamic environment to be applied in identification, classification, and retrieval of Web resources and content objects is the MARC metadata format. Dublin Core metadata initiative (DCMI) is another main and international metadata initiative which was originally created for application in identification, retrieval and classification of the Web content objects. An important point to make is that observing the metadata initiative for identification, retrieval and classification of resources and content objects and facilitation of their exchange process on the Web is only one side of the coin in the fast and relevant retrieval of the content objects. The other side is inevitable deeper attention to the most important and applicable internet search tools; since the majority of internet users use general search engines for searching and retrieving their needed sources, especially the content objects available in dynamic online information environments. Not only this, the interoperability of these tools and metadata schemes is also another major issue to be considered. Another major issue in identification and effective retrieval of content objects consisting of metadata elements relates to their semantic environment and platform. Dublin Core metadata initiative and MARC 21 tendency to XML (Extensible Markup Language) advanced technology is due to this technology s high capacity in increasing the interoperability of the most important and frequently used internet search tools with mentioned metadata initiatives. Implementation of MARC 21 and Dublin Core metadata elements on the XML platform has provided added values to both metadata initiatives. One advantage is that the indexing software of search engines is able to index these metadata initiative elements in XML thoroughly. Findings of Taheri (2008) research (in static information environment) emphasize this issue. However, the fast tracking, retrieval and storing of information in dynamic online environments remains an issue that should be addressed. Experts express metadata usage as a solution. On the other hand, as mentioned earlier, the majority of web users use common general search engines for searching and retrieval of content objects in dynamic online information environments. Therefore, suppliers are usually interested and keen about interoperability between these tools and metadata schemes so as to be able to provide tools more efficient in searching and retrieval of available content objects in such environments. Hence, the present study mainly aimed at examining the indexing quality and ranking of content objects consisting of the XML based MARC 21 and Dublin Core metadata elements in dynamic online information environments by general search engines. It also seeks to provide answers and solutions to how s about questions regarding the relation and interaction between indexing software of general search engines such as Yahoo and Google with XML-based content objects in dynamic online. 2
3 Another aim was to find out which one of the mentioned metadata initiatives is more efficient in indexing and ranking XML based content objects in dynamic online information environment by Yahoo and Google as search engines. As most of the information and scientific content objects are in dynamic online information environment, the importance of this research lies in unveiling the possibility of connection and interaction of the two indexing software of Yahoo and Google as public search engines with XML based content objects in dynamic online information environment and MARC 21 and Dublin Core metadata elements. Therefore, having identified the search engine with more efficiency in indexing and ranking of XML based content objects in dynamic online information environment, designers could use the relevant metadata elements in their schemes for indexing and ranking of search result. This would also make possible for designers of content objects in dynamic online information environments appropriate application and usage of metadata initiatives. Research Objectives To achieve the main goal of the present research, following stages were considered to follow: examining the difference in quality of indexing capabilities of Google and Yahoo search engines regarding content objects which contain XML based MARC 21 and Dublin Core Metadata elements in dynamic online information environment; examining the difference in ranking capabilities of Google and Yahoo search engines in regards with content objects consisting of XML-based MARC 21 and Dublin Core metadata elements in dynamic online information environment; observing and examining the reaction of indexing software (robots) of Yahoo and Google search engines to XML- based content objects in dynamic online information environment with both, flat (or tree) and hierarchical (or family) structures; observing and examining the reaction of indexing software (robots) of Yahoo and Google search engines to metadata initiatives, both with language based tags (Dublin Core) and without language based tags (MARC 21); Regarding Yahoo and Google search engines, selecting the metadata initiative which is likely more appropriate for organization of content objects in dynamic online information environment. The Research questions This research was about to find answers to the following seven questions: 1. How is the indexing quality of content objects containing XML-based Dublin Core metadata elements in dynamic online information environments as performed by Yahoo and Google search engines? 2. How is the indexing quality of content objects containing XML-based MARC 21 metadata elements in dynamic online information environments as performed by Yahoo and Google search engines? 3
4 3. What is the difference between the indexing quality of three main elements (title, author and subject) of content objects containing Dublin Core and XML based MARC 21 metadata elements in dynamic online information environments as performed by Yahoo and Google search engines? 4. What is the difference ranking procedure of content objects containing XML based MARC 21 and Dublin Core metadata elements in dynamic online information environments as performed by Yahoo and Google search engines? 5. How is the reaction of Yahoo and Google search engines to content objects of dynamic online information environments containing XML- based metadata elements with flat structure (Dublin Core) and hierarchical structure (MARC 21)? 6. How is the reaction of Yahoo and Google search engines to metadata initiatives with language- based tags (Dublin Core) and without language- based tags (MARC 21)? 7. Which one of MARC 21 and Dublin Core metadata initiatives is more suitable for classification of XML based content objects in dynamic online information environments in regard with accession through Google and Yahoo search engines? Research background Investigation of the current literature reveals that more research has been done elsewhere than in Iran on the present subject, of which some examples will be described as follows. Turner and Brackbill (1998) looked at the ways in which accessing HTML documents could be improved. As a result of their experimental research on Hypertext Markup Language (HTML) meta-tags in regards with web document retrieval by search engines, they found out that assigning the description feature alone was not able to improve accession, however, the keywords feature did improve accession. Sokvitne (2000) conducted a research on the websites of 20 Australian large educational and government organizations aiming at identifying the ability to retrieve key elements such as title, publisher, author, and subject in Dublin Core metadata initiative. Results of the study revealed that because of inconsistency in the content records formats, elements such as author, publisher and co-author which could be useful in searching and retrieval of objects, remained useless. Since the subject was not used properly and the title content was the same as the HTML title s tag content, these elements are not effective in the retrieval process. Henshaw and Valauskas (2001) conducted an experimental research on some selected pages of First Monday s electronic magazine. Two groups of pages were included in this research. One group functioned as the control group with no metadata element; and the other group had had Dublin Core metadata elements as well as HTML keywords and description meta-tags. Results of the study revealed that metadata alone did not have any impact on increasing the probability of the sources indexing and getting top ranking in search engines results. Zhang and Dimitroff (2004) in an article entitled Internet search engine's response to metadata Dublin Core implementation which was published on the basis of an experimental research, examined the function of seven main search engines which were categorized in 4
5 two groups: a target group and a control group. The target group consisted of subject element of the Dublin Core metadata scheme as well as keyword element from the HTML language. The control group lacked any such elements. The results showed that there was significant difference between two groups in terms of visibility for search engines; i.e., six out of the 7 search engines responded positively to metadata elements. Quevedo-Torrero (2004) devoted his doctoral thesis to improving retrieval from the web by hypertext markup language tags, using an experimental research method. The main purpose of this research was improving search quality and retrieval of pages in the web by inserting keyword in hypertext markup language meta-tags as metadata. In this research which was conducted on a selection of search results in search engines like Google and Altavista, some strategies were formulated and suggested for improvement in ranking of search results on the bases of using hypertext markup language meta-tags as metadata, and clustering web pages according to their link structures. Zhang and Dimitroff (2005a) examined the effect of web page content features on their visibility and inclusion in search engines result. This research aimed at finding answers to the question: how could ranking of a page or a site in search engine result be improved in view of authors or developers of pages or websites? The study results revealed that repetition of keywords in the title as well as in the full text body improves the visibility of pages in search engines results. Factors like color and font size proved having no effect on the visibility improvement. Zhang and Dimitroff (2005 b) conducted another experimental study to examine the effect of implementing metadata on the WWW pages appearance in search engines results. For this purpose they introduced 40 test web pages to 19 search engines. The results of the study showed that the metadata is an appropriate and effective mechanism in regard with page appearance and ranking in has effect on appearance and ranking improvement of web pages in the search engines result. Moreover, keywords extracted from web pages, especially from title and full-text body, proved to be very effective in ranking. Mohamed (2006) investigated the effect of metadata usage on the web pages ranking and retrieval. This research was conducted in two parts. In part one, the effect of metadata initiative on the accession of content objects was considered and examined. In part two, by adding metadata elements to web pages, the extent of their indexing was measured as well as the effect of metadata on page ranking. Results of this research showed that description elements and keywords have significant role in page ranking. Also, a couple of relevant studies have been conducted in Iran. Safari (2005) in a research on 16 articles that were published on the web version of the Iranian International Journal of Science examined the effect of Dublin Core metadata elements (4 elements out of 15 elements of this initiative) on the web source rank detecting and improvement of web sources ranks as conducted by three search engines of Google, AltaVista and Lycos. The results of this experimental study showed no significant differences between ranking of pages that contained Dublin Core metadata elements and those of the control group that lacked such elements. Also, no significant impact was seen in the retrieval of pages. Taheri (2008) conducted a comparative study on the indexing quality and ranking of content objects containing XML based Dublin Core and MARC 21 metadata elements by general search engines as his Master s thesis research project. Taheri's research shows 5
6 that there is no significant difference between the indexing quality of content objects containing XML based Dublin core and MARC 21 metadata elements as performed by Google and Yahoo search engines. Also, there was no observed significant difference between content objects ranking containing of the two metadata initiatives in Google search engine; however, there was significant difference in ranking status of content objects containing the two metadata initiatives in Yahoo search engine. Finally, none of the two mentioned XML- based metadata initiatives has preference over the other in regard with accessing through general search engines. The research method The present research experiments the interoperability of content objects containing XML based Dublin Core and MARC 21 metadata elements in dynamic online information environments with general search engines. 100 content objects, i.e. ebooks, were selected out of California digital library source set. These ebooks were selected using the url and focusing on the subject "theory of knowledge". The mentioned content objects were divided into two groups. The first group contained Dublin Core metadata elements (XML- based), and the second group contained XML based MARC 21 metadata elements. Both groups were mounted on and introduced to Yahoo and Google search engines from late July 2009 till June The data were collected in April The mentioned website was introduced to Google search engine by "Webmaster Tools" through "XML Sitemap" option and "Suggest a site". Introduction to Yahoo! was done using "Yahoo! Search URL Status Review Form" and "ROR & Text Sitemap" with the same condition. Google search engine could retrieve all the content objects fully by Dublin Core and MARC 21 Metadata elements, however, Yahoo search engine, despite many follow-ups, did not respond at all until, not only the deadline, but we believe until now! Therefore, the researchers had to rely only on the Google results. The data was collected by means of a checklist which was devised on the basis of, and according to research questions and requirements. Data gathering was conducted by the query: "keyphrase"site:marcdcmi.ir. The data that was collected by means of the checklist, were transferred to worksheets in which + and - signs were assigned as indications of being indexed or not being indexed, respectively. Each of these positions received 1 and 0 values respectively for calculation purposes. The sum of these values were then used in analyses and answering the research questions. The data thus provided, was then keyed in the SPSS Software and was analyzed according to research questions. Research findings As mentioned above, Yahoo search engine never responded to metadata elements retrieval till the deadline. Therefore, following will be an account and description of the results obtained by Google search engine. Table 1 is presented in line with answering the 1 st and 2 nd research questions. The contents of table 1 indicate that Google search engine has been able to index Dublin Core metadata initiative elements (9 elements) as well as MARC 21 elements (10 elements). 6
7 Therefore, XML based content objects which were embedded in the research dynamic online information environment, proved to be retrievable. In fact, the indexing quality of the selected elements by Google search engine is suitable. Table 1: the indexing quality of Dublin Core and MARC 21 metadata initiative elements in XML based content objects in dynamic online information environments by Google search engine The number The number Metadata Indexing of the of content website initiative (in percentage studied objects Google) elements %100 % Dublin Core MARC 21 Table 2 illustrates the indexing quality of Google search engine in regards with title, author and subject elements both in Dublin Core and Marc21 metadata schemes. The content of table 2 answers the 3 rd research question. With reviewing of the second table data for answering to the third question what is the difference between the indexing quality of three main elements (title, author and subject) of content objects containing Dublin Core and XML based MARC 21 metadata elements in dynamic online information environments as performed by Yahoo and Google search engines? Obviously, the data reveals that, Google search engine is able to index title, author, and subject content elements in Dublin Core and MARC 21 metadata initiatives. Therefore, there is no difference between these elements in this regard. Table 2: the indexing quality of title, author and subject elements related to Dublin Core and MARC 21 embedded in content objects XML based in dynamic information environments by Google search engine. The obtained point by content objects The number of content objects The studied main elements title website Metadata initiative (in Google) Dublin Core title MARC 21 author Dublin Core author MARC 21 subject Dublin Core subject MARC 21 Table 3 is used to answer the fourth research question regarding the rank quality. As the contents of table 3 indicate, Google search engine follows the same policy for ranking of XML- based content objects in dynamic online information. That is, this search engine 7
8 out of content objects containing MARC 21 metadata elements, places only 25 objects higher than content objects containing Dublin Core elem. In other words, the ratio of XML-based content objects containing metadata elements is equally 25 out of for both Dublin Core and Marc21. Table 3: The ranking output of XML based content objects containing Dublin Core and MARC21 in dynamic online information environments by Google search engine. The point of content objects placed higher total number of content objects website Metadata initiative (in Google) Dublin Core MARC 21 In answering the fifth and sixth questions, the contents of table 2 would be useful. The data in table 2 show that Google search engine indexing software does not discriminate between content objects with flat structure and with language based tag and those with hierarchical structure and without language based tag. Finally, the seventh research question is trying to determine the more suitable metadata initiative for organization of the XML- based content objects in dynamic online information environments in terms of accessibility by Google search engine. In answering this question, one would conclude that none of the XML- based Dublin Core metadata initiative, or MARC 21 shows any preference over the other in this regard. In other words, both metadata schemes are appropriate for organization of XML-based content objects in dynamic online information environments, as far as accessibility by Google is concerned. Discussion and Conclusion According to the above data and discussions, it can be concluded that XML, as the syntax ground for implementing the metadata elements of Dublin Core and Marc21, in analogy with HTML, can be effective both in Static and Dynamic environments; therefore, it seems to be more appropriate. Because it maximizes the interoperability between search engines and the metadata initiatives which aim at identification, description, locating and retrieving of content objects in static and dynamic online environments. Therefore, as is clear from answers to questions 1 and 2, both metadata initiatives, ie, Dublin Core and Marc21, could be regarded as appropriate for making different content objects accessible in dynamic online environments via Google search engine. On the other hand, none of the two metadata initiatives proved having clear preference and superiority over the other in regards with indexing competence and qualifications. As a result, once embedding these elements into the XML-based content objects in the online information environments, these objects could be accessed and retrieved easily. In regards with ranking of the content objects under study, it was found out that the Google search engine does not discriminate between the two metadata initiatives, as it 8
9 does about indexing metadata elements in both initiatives. That is, it follows a similar pattern and policy in ranking of the content objects containing these two initiatives. Also, from the answer to the 3 rd research question it was clear that the structure, whether flat or hierarchical, does not impact the quality of indexing of the content objects available in the online information environments. So is the reaction of the Google indexing software about metadata initiatives with and without language-based tags (Dublin Core against Marc21 respectively). That is, the software indexes both kinds of content objects in XML-based dynamic online information environments. Therefore, all in all, it can be concluded that both, the Dublin Core Initiative and the Marc21, are suitable for organization of XML-based content objects in dynamic online information environments. References Henshaw, Robin; Valauskas, Edward J. (2001). "Metadata as a Catalyst: Experiments with Metadata and Search Engines in the Internet Journal, First Monday". Libri, 51 (2): [online], available at: [5 Nov, 2009]. sion=1&_urlversion=0&_userid=10&md5=a853d410a866732d3f8ab5dd3217d412. [15 Nov. 2009]. Mohamed, Khaled A.f. (2006). "The impact of metadata in web resources discovering". Online Information Review, 30 (2), Quevedo- Torrero, J.U. (2004). "Improving Web Retrieval by Mining the HTML tags for Keywords and Exploring the Hyperlink Structures of Web Pages", [Abstract] doctoral Dissertation. University of Houston. [online], available at: [23 Oct, 2009]. Safari, Mahdi (2005). "Search Engines and Resource Discovery on the Web". Webology, 2 (2). [online], available at: [13 Nov. 2009]. Sokvitne, Lloyd (2000). "An Evaluation of the Effectiveness of Current Dublin Core Metadata for Retrieval". [online], available at: [13 Nov. 2009]. Taheri, mahdi (2008). "A comparative survey on the indexing quality and ranking of content objects containing Dublin Core and MARC 21 metadata elements XML based by general search engines". Thesis of library and information science, Islamic azad university of Tehran. Turner, Thomas P.; Brackbill, Lise (1998). "Rising to the Top: Evaluating the Use of the HTML Meta tag To Improve Retrieval of World Wide Web Documents through Internet Search Engines". Library Resources and Technical Services, 42 (4): [online], available at: [25 sep. 2009]. Zhang, J; Dimitroff, A (2005b). "The impact of metadata implementation on Webpage visibility in search engine result (Part II)".Information Processing and Management, 41(3), [online], available at: Zhang, Jin; Dimitroff, Alexandra (2004). "Internet search engine's response to metadata Dublin Core implementation". Journal of Information Science, 30 (4), [online], available at: [15 Nov. 2009]. Zhang, Jin; Dimitroff, Alexandra (2005a). "The impact of Webpage content characteristics on webpage visibility in search engine result (Part I)". Information Processing & 9
10 Management, 41 (3), [online], available at: [15 Nov. 2009]. doi: /j.ipm
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More information5 Choosing keywords Initially choosing keywords Frequent and rare keywords Evaluating the competition rates of search
Seo tutorial Seo tutorial Introduction to seo... 4 1. General seo information... 5 1.1 History of search engines... 5 1.2 Common search engine principles... 6 2. Internal ranking factors... 8 2.1 Web page
More informationEvaluating Web Ranking Metrics for Saudi Universities
Evaluating Web Ranking Metrics for Saudi Universities Ahmad Albhaishi 1, Heider A. Wahsheh 1, Tami Alghamdi 1 1 King Khalid University/ College of Computer Science, Computer Science Department Abha, Saudi
More informationIn the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google,
1 1.1 Introduction In the recent past, the World Wide Web has been witnessing an explosive growth. All the leading web search engines, namely, Google, Yahoo, Askjeeves, etc. are vying with each other to
More informationEXTRACTION OF RELEVANT WEB PAGES USING DATA MINING
Chapter 3 EXTRACTION OF RELEVANT WEB PAGES USING DATA MINING 3.1 INTRODUCTION Generally web pages are retrieved with the help of search engines which deploy crawlers for downloading purpose. Given a query,
More informationCHAPTER THREE INFORMATION RETRIEVAL SYSTEM
CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost
More informationA Composite Graph Model for Web Document and the MCS Technique
A Composite Graph Model for Web Document and the MCS Technique Kaushik K. Phukon Department of Computer Science, Gauhati University, Guwahati-14,Assam, India kaushikphukon@gmail.com Abstract It has been
More informationKDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW. Ana Azevedo and M.F. Santos
KDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW Ana Azevedo and M.F. Santos ABSTRACT In the last years there has been a huge growth and consolidation of the Data Mining field. Some efforts are being done
More informationDeep Web Crawling and Mining for Building Advanced Search Application
Deep Web Crawling and Mining for Building Advanced Search Application Zhigang Hua, Dan Hou, Yu Liu, Xin Sun, Yanbing Yu {hua, houdan, yuliu, xinsun, yyu}@cc.gatech.edu College of computing, Georgia Tech
More informationMATRIX BASED INDEXING TECHNIQUE FOR VIDEO DATA
Journal of Computer Science, 9 (5): 534-542, 2013 ISSN 1549-3636 2013 doi:10.3844/jcssp.2013.534.542 Published Online 9 (5) 2013 (http://www.thescipub.com/jcs.toc) MATRIX BASED INDEXING TECHNIQUE FOR VIDEO
More informationHyper Text Mark-up Language and Dublin Core metadata element set usage in websites of Iranian State Universities libraries
[Downloaded free from http://www.jehp.net on Wednesday, January 14, 215, IP: 128.199.63.23] Click here to download free Android application for this journal Original Article Hyper Text Mark-up Language
More informationOverview of Web Mining Techniques and its Application towards Web
Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous
More informationSearch Engine Optimisation Basics for Government Agencies
Search Engine Optimisation Basics for Government Agencies Prepared for State Services Commission by Catalyst IT Neil Bertram May 11, 2007 Abstract This document is intended as a guide for New Zealand government
More informationEffective On-Page Optimization for Better Ranking
Effective On-Page Optimization for Better Ranking 1 Dr. N. Yuvaraj, 2 S. Gowdham, 2 V.M. Dinesh Kumar and 2 S. Mohammed Aslam Batcha 1 Assistant Professor, KPR Institute of Engineering and Technology,
More informationFederated Searching: User Perceptions, System Design, and Library Instruction
Federated Searching: User Perceptions, System Design, and Library Instruction Rong Tang (Organizer & Presenter) Graduate School of Library and Information Science, Simmons College, 300 The Fenway, Boston,
More informationTaccumulation of the social network data has raised
International Journal of Advanced Research in Social Sciences, Environmental Studies & Technology Hard Print: 2536-6505 Online: 2536-6513 September, 2016 Vol. 2, No. 1 Review Social Network Analysis and
More informationINTRODUCTION. Chapter GENERAL
Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which
More informationSK International Journal of Multidisciplinary Research Hub Research Article / Survey Paper / Case Study Published By: SK Publisher
ISSN: 2394 3122 (Online) Volume 2, Issue 1, January 2015 Research Article / Survey Paper / Case Study Published By: SK Publisher P. Elamathi 1 M.Phil. Full Time Research Scholar Vivekanandha College of
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:
IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T
More informationis an electronic document that is both user friendly and library friendly
is an electronic document that is both user friendly and library friendly is easy to read and to navigate it has bookmarks and an interactive table-of-contents is practical to consult and arouses more
More informationAn Analysis of Image Retrieval Behavior for Metadata Type and Google Image Database
An Analysis of Image Retrieval Behavior for Metadata Type and Google Image Database Toru Fukumoto Canon Inc., JAPAN fukumoto.toru@canon.co.jp Abstract: A large number of digital images are stored on the
More informationI&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING. Andrii Donchenko
International Journal "Information Technologies and Knowledge" Vol.1 / 2007 293 I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING Andrii Donchenko Abstract: This article considers
More informationAccessibility of INGO FAST 1997 ARTVILLE, LLC. 32 Spring 2000 intelligence
Accessibility of INGO FAST 1997 ARTVILLE, LLC 32 Spring 2000 intelligence On the Web Information On the Web Steve Lawrence C. Lee Giles Search engines do not index sites equally, may not index new pages
More informationD DAVID PUBLISHING. Big Data; Definition and Challenges. 1. Introduction. Shirin Abbasi
Journal of Energy and Power Engineering 10 (2016) 405-410 doi: 10.17265/1934-8975/2016.07.004 D DAVID PUBLISHING Shirin Abbasi Computer Department, Islamic Azad University-Tehran Center Branch, Tehran
More informationProviding Interactive Site Ma ps for Web Navigation
Providing Interactive Site Ma ps for Web Navigation Wei Lai Department of Mathematics and Computing University of Southern Queensland Toowoomba, QLD 4350, Australia Jiro Tanaka Institute of Information
More informationE - INFORMATION SEARCH STRATEGY BY FACULTY OF SCIENCE DEPARTMENT, NORTH ORISSA UNIVERSITY: A CASE STUDY
E - INFORMATION SEARCH STRATEGY BY FACULTY OF SCIENCE DEPARTMENT, NORTH ORISSA UNIVERSITY: A CASE STUDY Babita Pattanaik Lecturer, Utkal University, Bhubaneswar Orissa Bibhuti Bhusan Pattanaik, Asst. Librarian
More informationA Survey On Different Text Clustering Techniques For Patent Analysis
A Survey On Different Text Clustering Techniques For Patent Analysis Abhilash Sharma Assistant Professor, CSE Department RIMT IET, Mandi Gobindgarh, Punjab, INDIA ABSTRACT Patent analysis is a management
More informationBenchmarking Google Scholar with the New Zealand PBRF research assessment exercise
Benchmarking Google Scholar with the New Zealand PBRF research assessment exercise Alastair G Smith School of Information Management Victoria University of Wellington New Zealand alastair.smith@vuw.ac.nz
More informationHow To Construct A Keyword Strategy?
Introduction The moment you think about marketing these days the first thing that pops up in your mind is to go online. Why is there a heck about marketing your business online? Why is it so drastically
More informationInstitutional Repository - Research Portal Dépôt Institutionnel - Portail de la Recherche
Institutional Repository - Research Portal Dépôt Institutionnel - Portail de la Recherche researchportal.unamur.be THESIS / THÈSE DOCTOR OF SCIENCES Methodology for automating web usability and accessibility
More informationOn-Site Analysis. Alex Gurevich
On-Site Analysis www.nettology.net Alex Gurevich Thursday, October 13, 2016 Table of Contents On-Site Analysis... 3 Load Time Testing: http://www.nettology.net/... 3 Keywords to Optimize for:... 3 Load
More information5 THINGS TO KNOW ABOUT SEO IN 2018 A Quick and Easy-To-Follow SEO E-Book
5 THINGS TO KNOW ABOUT SEO IN 2018 A Quick and Easy-To-Follow SEO E-Book It is no secret: those of us involved in the world of SEO know how far-reaching and complex the everchanging industry can be. With
More informationAN OVERVIEW OF SEARCHING AND DISCOVERING WEB BASED INFORMATION RESOURCES
Journal of Defense Resources Management No. 1 (1) / 2010 AN OVERVIEW OF SEARCHING AND DISCOVERING Cezar VASILESCU Regional Department of Defense Resources Management Studies Abstract: The Internet becomes
More informationA Web Page Segmentation Method by using Headlines to Web Contents as Separators and its Evaluations
IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.1, January 2013 1 A Web Page Segmentation Method by using Headlines to Web Contents as Separators and its Evaluations Hiroyuki
More informationSemantic Web and Electronic Information Resources Danica Radovanović
D.Radovanovic: Semantic Web and Electronic Information Resources 1, Infotheca journal 4(2003)2, p. 157-163 UDC 004.738.5:004.451.53:004.22 Semantic Web and Electronic Information Resources Danica Radovanović
More informationFinding Neighbor Communities in the Web using Inter-Site Graph
Finding Neighbor Communities in the Web using Inter-Site Graph Yasuhito Asano 1, Hiroshi Imai 2, Masashi Toyoda 3, and Masaru Kitsuregawa 3 1 Graduate School of Information Sciences, Tohoku University
More informationAutomated Online News Classification with Personalization
Automated Online News Classification with Personalization Chee-Hong Chan Aixin Sun Ee-Peng Lim Center for Advanced Information Systems, Nanyang Technological University Nanyang Avenue, Singapore, 639798
More informationLife Science Journal 2017;14(2) Optimized Web Content Mining
Optimized Web Content Mining * K. Thirugnana Sambanthan,** Dr. S.S. Dhenakaran, Professor * Research Scholar, Dept. Computer Science, Alagappa University, Karaikudi, E-mail: shivaperuman@gmail.com ** Dept.
More informationA Comparative Study of Data Mining Process Models (KDD, CRISP-DM and SEMMA)
International Journal of Innovation and Scientific Research ISSN 2351-8014 Vol. 12 No. 1 Nov. 2014, pp. 217-222 2014 Innovative Space of Scientific Research Journals http://www.ijisr.issr-journals.org/
More informationA World Wide Web-based HCI-library Designed for Interaction Studies
A World Wide Web-based HCI-library Designed for Interaction Studies Ketil Perstrup, Erik Frøkjær, Maria Konstantinovitz, Thorbjørn Konstantinovitz, Flemming S. Sørensen, Jytte Varming Department of Computing,
More informationUsing Text Elements by Context to Display Search Results in Information Retrieval Systems Model and Research results
Using Text Elements by Context to Display Search Results in Information Retrieval Systems Model and Research results Offer Drori SHAAM Information Systems The Hebrew University of Jerusalem offerd@ {shaam.gov.il,
More informationSite Design Critique Paper. i385f Special Topics in Information Architecture Instructor: Don Turnbull. Elias Tzoc
Site Design Critique Site Design Critique Paper i385f Special Topics in Information Architecture Instructor: Don Turnbull Elias Tzoc February 20, 2007 Site Design Critique - 1 Introduction Universidad
More informationComplimentary SEO Analysis & Proposal. ageinplaceofne.com. Rashima Marjara
Complimentary SEO Analysis & Proposal ageinplaceofne.com Rashima Marjara Wednesday, March 8, 2017 CONTENTS Contents... 1 Account Information... 3 Introduction... 3 Website Performance Analysis... 4 organic
More informationTable of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey.
Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey. Chapter 1: Organization of Recorded Information The Need to Organize The Nature of Information Organization
More informationWeb Data mining-a Research area in Web usage mining
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 13, Issue 1 (Jul. - Aug. 2013), PP 22-26 Web Data mining-a Research area in Web usage mining 1 V.S.Thiyagarajan,
More informationDiscovery services: next generation of searching scholarly information
Discovery services: next generation of searching scholarly information Article (Unspecified) Keene, Chris (2011) Discovery services: next generation of searching scholarly information. Serials, 24 (2).
More informationDescription Cross-domain Task Force Research Design Statement
Description Cross-domain Task Force Research Design Statement Revised 8 November 2004 This document outlines the research design to be followed by the Description Cross-domain Task Force (DTF) of InterPARES
More informationInvestigation of the Status of User Interface in Web-based Version of Library Software in Khuzestan Province Islamic Azad University Branches
2014, TextRoad Publication ISSN: 2090-4274 Journal of Applied Environmental and Biological Sciences www.textroad.com Investigation of the Status of User Interface in Web-based Version of Library Software
More informationijade Reporter An Intelligent Multi-agent Based Context Aware News Reporting System
ijade Reporter An Intelligent Multi-agent Based Context Aware Reporting System Eddie C.L. Chan and Raymond S.T. Lee The Department of Computing, The Hong Kong Polytechnic University, Hung Hong, Kowloon,
More informationSEO Services Sample Proposal
SEO Services Sample Proposal Scroll down to read the first part of this sample. When purchased, the complete sample is 18 pages long and is written using these Proposal Pack chapters: Cover Letter, Title
More informationNowadays data-intensive applications play a
Journal of Advances in Computer Engineering and Technology, 3(2) 2017 Data Replication-Based Scheduling in Cloud Computing Environment Bahareh Rahmati 1, Amir Masoud Rahmani 2 Received (2016-02-02) Accepted
More informationIncreasing access to OA material through metadata aggregation
Increasing access to OA material through metadata aggregation Mark Jordan Simon Fraser University SLAIS Issues in Scholarly Communications and Publishing 2008-04-02 1 We will discuss! Overview of metadata
More informationQuery Modifications Patterns During Web Searching
Bernard J. Jansen The Pennsylvania State University jjansen@ist.psu.edu Query Modifications Patterns During Web Searching Amanda Spink Queensland University of Technology ah.spink@qut.edu.au Bhuva Narayan
More informationWeb Usage Mining: A Research Area in Web Mining
Web Usage Mining: A Research Area in Web Mining Rajni Pamnani, Pramila Chawan Department of computer technology, VJTI University, Mumbai Abstract Web usage mining is a main research area in Web mining
More informationMarket Information Management in Agent-Based System: Subsystem of Information Agents
Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2006 Proceedings Americas Conference on Information Systems (AMCIS) December 2006 Market Information Management in Agent-Based System:
More informationMetadata for general purposes
H O M E E X E R C I S E S Metadata for general purposes Dublin Core Exercises and Sources A star* = newly updated or added Printer friendly version (PDF) DC creation tool to be used: Online: Template for
More informationRETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2
Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 1907-1911 1907 Web-Based Data Mining in System Design and Implementation Open Access Jianhu
More informationWeb Mining Evolution & Comparative Study with Data Mining
Web Mining Evolution & Comparative Study with Data Mining Anu, Assistant Professor (Resource Person) University Institute of Engineering and Technology Mahrishi Dayanand University Rohtak-124001, India
More informationBasics of SEO Published on: 20 September 2017
Published on: 20 September 2017 DISCLAIMER The data in the tutorials is supposed to be one for reference. We have made sure that maximum errors have been rectified. Inspite of that, we (ECTI and the authors)
More informationWeb Services Take Root in Banks and With Asset Managers
Strategic Planning, M. Knox, W. Andrews, C. Abrams Research Note 18 December 2003 Web Services Take Root in Banks and With Asset Managers Financial-services providers' early Web services implementations
More informationInteroperability for Digital Libraries
DRTC Workshop on Semantic Web 8 th 10 th December, 2003 DRTC, Bangalore Paper: C Interoperability for Digital Libraries Michael Shepherd Faculty of Computer Science Dalhousie University Halifax, NS, Canada
More informationThe DOI Identifier. Drexel University. From the SelectedWorks of James Gross. James Gross, Drexel University. June 4, 2012
Drexel University From the SelectedWorks of James Gross June 4, 2012 The DOI Identifier James Gross, Drexel University Available at: https://works.bepress.com/jamesgross/26/ The DOI Identifier James Gross
More informationDEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES
DEC. 1-5 Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES Monday Overview of Databases A web search engine is a large database containing information about Web pages that have been registered
More informationA web directory lists web sites by category and subcategory. Web directory entries are usually found and categorized by humans.
1 After WWW protocol was introduced in Internet in the early 1990s and the number of web servers started to grow, the first technology that appeared to be able to locate them were Internet listings, also
More informationDomain Specific Search Engine for Students
Domain Specific Search Engine for Students Domain Specific Search Engine for Students Wai Yuen Tang The Department of Computer Science City University of Hong Kong, Hong Kong wytang@cs.cityu.edu.hk Lam
More informationSEARCH SEMI-STRUCTURED DATA ON WEB
SEARCH SEMI-STRUCTURED DATA ON WEB Sabin-Corneliu Buraga 1, Teodora Rusu 2 1 Faculty of Computer Science, Al.I.Cuza University of Iaşi, Romania Berthelot Str., 16 6600 Iaşi, Romania, tel: +40 (32 201529,
More informationDeveloping an Automatic Metadata Harvesting and Generation System for a Continuing Education Repository: A Pilot Study
Developing an Automatic Metadata Harvesting and Generation System for a Continuing Education Repository: A Pilot Study Jung-Ran Park 1, Akshay Sharma 1, Houda El Mimouni 1 1 Drexel University, College
More informationWEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS
1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,
More informationComparison of FP tree and Apriori Algorithm
International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 6 (June 2014), PP.78-82 Comparison of FP tree and Apriori Algorithm Prashasti
More informationThe Design of The Integration System for OTOP Products Data Using Web Services Technology, Thailand
MACROCONFERENCE The MacroConference Proceedings The Design of The Integration System for OTOP Products Data Using Web Services Technology, Thailand Sasitorn Phimansakulwat Faculty of Business Administration,
More informationEnas El-Sayed Mohammed El-Sharawy
Enas El-Sayed Mohammed El-Sharawy Assistant Professor Computer Department Education Faculty Jubail Personal Data Nationality Egyptian Date of Birth 3 November 1984 Department Computer Science Official
More informationAdaptive and Personalized System for Semantic Web Mining
Journal of Computational Intelligence in Bioinformatics ISSN 0973-385X Volume 10, Number 1 (2017) pp. 15-22 Research Foundation http://www.rfgindia.com Adaptive and Personalized System for Semantic Web
More informationRanking Techniques in Search Engines
Ranking Techniques in Search Engines Rajat Chaudhari M.Tech Scholar Manav Rachna International University, Faridabad Charu Pujara Assistant professor, Dept. of Computer Science Manav Rachna International
More informationAlphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS. Jenn Riley IU Metadata Librarian DLP Brown Bag Series February 25, 2005
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS Jenn Riley IU Metadata Librarian DLP Brown Bag Series February 25, 2005 Descriptive metadata Enables users to find relevant materials Used
More informationSearch Engine Visibility Analysis
2018 Search Engine Visibility Analysis We do the market research, so you don t have to! Report For www.yourclientsite.com Contents Introduction... 2 Website Analysis and Recommendations... 3 Current Status
More informationAn Interactive Web based Expert System Degree Planner
An Interactive Web based Expert System Degree Planner Neil Dunstan School of Science and Technology University of New England Australia ph: +61 2 67732350 fax: +61 2 67735011 neil@cs.une.edu.au ABSTRACT
More informationImage Similarity Measurements Using Hmok- Simrank
Image Similarity Measurements Using Hmok- Simrank A.Vijay Department of computer science and Engineering Selvam College of Technology, Namakkal, Tamilnadu,india. k.jayarajan M.E (Ph.D) Assistant Professor,
More informationWebsite Name. Project Code: # SEO Recommendations Report. Version: 1.0
Website Name Project Code: #10001 Version: 1.0 DocID: SEO/site/rec Issue Date: DD-MM-YYYY Prepared By: - Owned By: Rave Infosys Reviewed By: - Approved By: - 3111 N University Dr. #604 Coral Springs FL
More informationAdaptable and Adaptive Web Information Systems. Lecture 1: Introduction
Adaptable and Adaptive Web Information Systems School of Computer Science and Information Systems Birkbeck College University of London Lecture 1: Introduction George Magoulas gmagoulas@dcs.bbk.ac.uk October
More informationTABLE OF CONTENTS CHAPTER NO. TITLE PAGENO. LIST OF TABLES LIST OF FIGURES LIST OF ABRIVATION
vi TABLE OF CONTENTS ABSTRACT LIST OF TABLES LIST OF FIGURES LIST OF ABRIVATION iii xii xiii xiv 1 INTRODUCTION 1 1.1 WEB MINING 2 1.1.1 Association Rules 2 1.1.2 Association Rule Mining 3 1.1.3 Clustering
More information3/21/2016 AN INTRODUCTION TO SEARCH ENGINE OPTIMIZATION. Search Engine Optimization (SEO) Basics for Attorneys
AN INTRODUCTION TO SEARCH ENGINE OPTIMIZATION DCBA LAW PRACTICE MANAGEMENT & TECHNOLOGY SECTION MARCH 22, 2016 Presenter: Christine P. Miller, OVC Lawyer Marketing Search Engine Optimization (SEO) Basics
More informationE-Shiksha Academy. Certified SEO Professional
E-Shiksha Academy Earn While You Learn... Certified SEO Professional Certified SEO Professional Certification Vskills certification for Search Engine Optimization assesses the candidate as per the company
More informationDevelopment of Contents Management System Based on Light-Weight Ontology
Development of Contents Management System Based on Light-Weight Ontology Kouji Kozaki, Yoshinobu Kitamura, and Riichiro Mizoguchi Abstract In the Structuring Nanotechnology Knowledge project, a material-independent
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW PAPER ON IMPLEMENTATION OF DOCUMENT ANNOTATION USING CONTENT AND QUERYING
More informationSemantic Web Mining and its application in Human Resource Management
International Journal of Computer Science & Management Studies, Vol. 11, Issue 02, August 2011 60 Semantic Web Mining and its application in Human Resource Management Ridhika Malik 1, Kunjana Vasudev 2
More information3 Publishing Technique
Publishing Tool 32 3 Publishing Technique As discussed in Chapter 2, annotations can be extracted from audio, text, and visual features. The extraction of text features from the audio layer is the approach
More informationIntroducing an Intelligent E-learning Content Constructor Engine
2011 2nd International Conference on Education and Management Technology IPEDR vol.13 (2011) (2011) IACSIT Press, Singapore Introducing an Intelligent E-learning Content Constructor Engine Hossein Keynejad
More informationTitle: Interactive data entry and validation tool: A collaboration between librarians and researchers
Proposed venue: Library Hi Tech News Title: Interactive data entry and validation tool: A collaboration between librarians and researchers Author: Abstract Purpose To share a case study process of collaboration
More informationMetadata Workshop 3 March 2006 Part 1
Metadata Workshop 3 March 2006 Part 1 Metadata overview and guidelines Amelia Breytenbach Ria Groenewald What metadata is Overview Types of metadata and their importance How metadata is stored, what metadata
More informationYou got a website. Now what?
You got a website I got a website! Now what? Adriana Kuehnel Nov.2017 The majority of the traffic to your website will come through a search engine. Need to know: Best practices so ensure your information
More informationQuagmire or Goldmine?
The World-Wide Wide Web: Quagmire or Goldmine? Oren Etzioni [Comm. of the ACM, Nov 1996] Presentation Credits: Shabnam Sobti 30 - OCT - 2002 WWW - Quagmire or Goldmine? 1 Agenda Prelude: The Internet Story
More informationDesign and Implementation of Search Engine Using Vector Space Model for Personalized Search
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 1, January 2014,
More informationRevealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization
Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Katsuya Masuda *, Makoto Tanji **, and Hideki Mima *** Abstract This study proposes a framework to access to the
More informationManagement Science Letters
Management Science Letters 4 (2014) 111 116 Contents lists available at GrowingScience Management Science Letters homepage: www.growingscience.com/msl A new method for converting extended version of petri
More informationWeb Crawling. Jitali Patel 1, Hardik Jethva 2 Dept. of Computer Science and Engineering, Nirma University, Ahmedabad, Gujarat, India
Web Crawling Jitali Patel 1, Hardik Jethva 2 Dept. of Computer Science and Engineering, Nirma University, Ahmedabad, Gujarat, India - 382 481. Abstract- A web crawler is a relatively simple automated program
More informationVIDEO SEARCHING AND BROWSING USING VIEWFINDER
VIDEO SEARCHING AND BROWSING USING VIEWFINDER By Dan E. Albertson Dr. Javed Mostafa John Fieber Ph. D. Student Associate Professor Ph. D. Candidate Information Science Information Science Information Science
More informationAutomatic New Topic Identification in Search Engine Transaction Log Using Goal Programming
Proceedings of the 2012 International Conference on Industrial Engineering and Operations Management Istanbul, Turkey, July 3 6, 2012 Automatic New Topic Identification in Search Engine Transaction Log
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON WEB CONTENT MINING DEVEN KENE 1, DR. PRADEEP K. BUTEY 2 1 Research
More informationOptimization of Query Processing in XML Document Using Association and Path Based Indexing
Optimization of Query Processing in XML Document Using Association and Path Based Indexing D.Karthiga 1, S.Gunasekaran 2 Student,Dept. of CSE, V.S.B Engineering College, TamilNadu, India 1 Assistant Professor,Dept.
More informationCharacterizing Home Pages 1
Characterizing Home Pages 1 Xubin He and Qing Yang Dept. of Electrical and Computer Engineering University of Rhode Island Kingston, RI 881, USA Abstract Home pages are very important for any successful
More information