Supporting Constructivist Learning in a Multimedia Presentation System

Size: px
Start display at page:

Download "Supporting Constructivist Learning in a Multimedia Presentation System"

Transcription

1 Supporting Constructivist Learning in a Multimedia Presentation System Dula Kumela 1, Kenneth Watts 2, and W. Richards Adrion 3 Abstract - The Research in Presentation Production for Learning Electronically (RIPPLES) group in the Department of Computer Science at UMASS Amherst have developed a course delivery system named Multimedia Asynchronous Networked Individualized Courseware (MANIC). MANIC uses the approach of record and playback. While record and playback technologies can be very effective in supporting a constructivist mode of instructional delivery, the technology is not inherently constructivist. In support of a more constructivist mode of instruction, we have implemented advanced indexing and search features in MANIC that makes use of ranking and relevance, and a query expansion technique to generate queries and conduct search over the World Wide Web (WWW) using Google. In this paper we describe initial experiments conducted, and our plans for additional assessment and enhancement of the search mechanism. deliver lectures and course materials in and outside of the classroom. The RIPPLES Project delivers lectures using several related technologies. Most are based on the MANIC framework and include streaming and CD/DVD modes of courseware delivery. Current RIPPLES technologies include a number of features important to creating a constructivist-learning environment. Figure 1 shows how the indexing and search mechanisms within the CD-MANIC courseware can be linked with an on-line text. Indexes are interlinked so that students can search the text or the CD-MANIC presentation or use either the text or CD-MANIC index to reach linked points in the text and associated lecture. The Learner Logger [7] is embedded within the CD-MANIC browser. Data are collected Index Terms - Constructivist Learning, Query expansion, Precision, Ranking, Recall, Relevance and Search. INTRODUCTION Constructivist-oriented instructional models, including active learning, peer learning, and cooperative learning, are effective in computer science education, for example [9,10]. Constructivism holds that learners define their own meaning to the world by constructing understanding through experience. For the purpose of this paper, we view constructivist learning as an environment where learners, typically in groups, explore (with guidance from an instructor) the learning environment and construct meaning based on their (shared) learning experiences and active investigation of provided materials. A constructivist-learning environment [11], whether physical or technology-based, should be active (learners are engaged), constructive (learners build on prior knowledge, integrating with new ideas and experiences to construct meaning), intentional (goal-directed), complex (learners face ill-structured and complicated problems), contextual (problems are situated in a realistic context), conversational (learners interact with each other) and reflective (learners report on progress and process). The RIPPLES Project investigates how to most effectively use the WWW and CD/DVD-ROM technology to FIGURE 1 CD MANIC EXAMPLE continuously and uploaded to a server in the background during student sessions, and allow us to track all student behavior including: format choices (video, audio, slides); relative size and positioning on the screen time spent on each step of the presentation, external links followed; menu choices; responses to queries; quiz scores; and student feedback and evaluation forms. Most record and playback approaches, such as the original RIPPLES/MANIC technology, produce courseware that is largely passive, typically only allowing some form of 1 Dula Kumela, Research Assistant, University of Massachusetts Amherst, RIPPLES laboratory, dkumela@cs.umass.edu 2 Kenneth Watts, Senior Software Engineer, University of Massachusetts Amherst, RIPPLES laboratory, watts@cs.umass.edu 3 W. Richards Adrion, Professor of Computer science, University of Massachusetts Amherst, RIPPLES laboratory, adrion@cs.umass.edu T1H-18

2 simple search over the text in the recorded lectures, and they generally have limited ability to support learner navigation, a key to knowledge discovery and construction. Our opportunity to apply constructivist pedagogies depends on creating a constructivist-learning environment in and outside the classroom. To do this, we had to go well beyond record-andplayback technologies and the extended search mechanism described here is an important first step. ADVANCED SEARCH IN CD-MANIC In the original RIPPLES/MANIC courseware, learners can search over the text in lecture slides and, if available, the electronic version of the course text book using simple Boolean combinations of keywords [1]. An index to the matched slides/text is presented to the learner as a sequentially ordered list. These results are not extremely useful to the user since the search does not rank the result based on relevance to the topic the user is querying. Search is limited within a single course, rather than allowing searches over multiple courses and over the great wealth of information available in the internet. As the standard index (slides, text) has improved, Learner Logger data show that users seldom employ search. To support constructivist learning, learners need a mechanism to search for a given topic or subject from material available within a given course, across related courses, within texts and other reference material, and throughout the WWW. To enable broader exploration and knowledge construction, we replaced CD-MANIC brute force searching with a more advanced searching technique that makes use of ranking and relevance. The updated system generates matching documents (e.g., relevant lecture segments) from within the locally available material. A second-level query, derived from content of the matching documents and the initial user query using query expansion techniques, is conducted over the WWW and a database of reference material using Google TM. The returned local and WWW links are filtered by relevance to learner interests. MANIC COURSE DEVELOPER MANIC USER MANIC INDEXER MANIC BROWSER LOGGER Log Database Server Index databse JNI Connector Query Expansion Module FIGURE 2 ARCHITECTURE OF MANIC SEARCH LUCENE API GOOGLE API TECHNOLOGY ARCHITECTURE The new RIPPLES/MANIC search mechanism was developed using open source systems and frameworks and free online services, integrated with software developed by the RIPPLES Group. RIPPLES component software includes a query expansion module, a user interface module for indexing, an extension to the Learner Logger to capture user interaction with the system, and a Java Native Interface (JNI) component that provides the interface between open source code implemented in Java and the existing C++ implementation of CD-MANIC. To index and search lecture slides (and online texts), we used the open source Lucene search Application Program Interface (API) available from the Apache Jakarta project [2]. The decision to use Lucene was made based on the fact that it is an open source API that we can modify to meet our needs. Lucene also provides most of the features we were looking for in a search engine such as ranking and relevance when performing a search over indexed material. The Google TM search API [3] is used to search for materials over the WWW. The overall architecture of the new CD-MANIC search is depicted in Figure 2. DESCRIPTION OF COMPONENTS In this section we will give description of the components that make up the advanced search feature of CD-MANIC. The discussion will include description and usage of the following: Lucene API Google API Query expansion module JNI integration with CD-MANIC Learner Logger interface Lucene API Lucene is a high-performance, full-featured text search engine written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially crossplatform. Lucene was started by Doug Cutting 4 as an independent project in September 2001 it became an official Jakarta project. As most of the Jakarta projects, Lucene is also provided as an open source library. Lucene provides an easy to use API for indexing a set of documents and performing searches over the created index. Indexing is the process of creating a special database (an index ) that contains a compiled version of documents, optimized for quick lookup of a list of documents that contain certain words or terms. By default, Lucene stores the index as a set of files in a file system. Lucene provides the flexibility to implement other storage methods such as nonresident inmemory storage, or mapping of Lucene data to any third party relational database. Lucene API provides elaborate control over the information stored in the index for each document and how this information is used during indexing and searching. On 4 Doug Cutting, originator of the Lucene search, cutting@ apache.org T1H-19

3 one extreme, it is possible to store for each document just its location (e.g. URL or file path) and index the content of the document as a monolithic piece of text. On the other extreme, it is possible to store the entire document as well as various attributes such as Author, Title, and Date and perform searches that consider these attribute for matching and ranking. One of the major advantages of Lucene over other currently available open source search engines is its ability to index any file or document type. Lucene supports an API that a developer can use to index a file as far as it meets a given standard without being specific to any file type and gives the responsibility of reading in and parsing the document to be indexed to the developer using the API. A given document can be indexed using Lucene as long as the developer can provide a parser for the document based on the Lucene standard. Searching in Lucene is the operation of locating a subset of the documents that contains desired content or has attributes that match some specification. The input for a search operation is a query (could be a term, a set of terms or Boolean combination of terms) that specifies a criteria for selecting the documents and its output is a list of documents or hits that matched that criteria. The hit list is typically ordered by some measure of relevancy (called ranking or scoring ) and may contain only a subset of the set of documents that matched the query (typically the documents that have the highest rank or score). The search operation is performed on the index. The index database is optimized for locating documents that contains certain words or terms quickly. Rank or score of documents for a given search query is calculated using (1). score_d = sum_t (tf_q * idf_t / norm_q * tf_d * idf_t / norm_d_t * boost_t) * coord_q_d (1) VALUE score_d sum_t tf_q TABLE I DESCRIPTION OF VALUES IN (1) DESCRIPTION Score for document d Sum for all terms t The square root of the frequency of t in the query idf_t log (numdocs/docfreq_t+1) + 1 norm_q sqrt(sum_t((tf_q*idf_t)^2)) tf_d The square root of the frequency of t in d norm_d_t Square root of number of tokens in d in the same field as t boost_t The user-specified boost for term t coord_q_d Number of terms in both query and document / number of terms in query numdocs Number of documents in index docfreq_t Number of documents containing term t Lucene is a library that provides an API to index and search a set of documents; it is not a standalone application or does not include a user interface. For this reason, we had to develop an interface which can be used to select and index a user select a set of documents from a local file system and index them. It also has a feature to indicate the indexing progress and show the listing of documents that have been indexed. We currently support indexing for text and html documents which meets our current requirement. Support for other file types can be added with minimal effort as needed. The interface also lets a user load a given index database and perform search operation. This helps to test a given index database after indexing a set of documents. FIGURE 3 CD-MANIC INDEXER USER INTERFACE Google TM API Google TM is a widely used search engine. Its popularity has increased over the years. The developers at Google TM provide an API that allows developers to use the Google TM search engine from within their own application. This service is provided as a Web service that can be accessed over the Internet. The Google TM API provides functions to support three different request submission types: search, cache and spelling. Search requests include a query string and a set of parameters to the Google TM Web API service and receive in return a set of search results derived from Google TM 's index of over 2 billion Web pages. Cache requests send a URL and receive in return the contents of the URL when Google TM 's crawlers last visited the page (if available). Spelling requests receive a suggested spell correction for the query (if available). Spell corrections mimic the same behavior as found on the Google TM Web site and are subject to the same query string limitations as any other search request. The input string is limited to 2048 bytes and 10 individual words. The return type for spelling requests is a text string. In our implementation of the RIPPLES/MANIC search we used only the Google TM search request feature. At the current stage of our product, we have not found use for the other two features provided in the API. When passing the search query that was supplied to CD-MANIC search directly to the Google TM search API, some of the results returned from Google TM were not relevant to the subject the user was querying. To improve the amount of relevant documents returned from the search we implemented a query expansion module described in the next section. documents. The first version of the user interface module lets T1H-20

4 Query Expansion Module The Google TM search engine returns some documents that are not relevant when supplied with the query from the user of the system. This could be because Google TM keeps track of various documents in its database that have the terms supplied by the user query but not related to the subject the user is searching on. It was necessary that we find a mechanism that would solve this problem. We implemented a query expansion module that makes use of locally available documents to generate a more descriptive query that generates more relevant results from Google TM. Automatic query expansion has long been suggested as a technique for dealing with the fundamental issue of word mismatch in information retrieval [4]. If a relevant document does not contain the terms that are in the query, then that document will not be retrieved. The aim of query expansion is to reduce this query/document mismatch by expanding the query using words or phrases with similar meaning or some other statistical relation to the set of relevant documents [5]. A number of approaches to query expansion have been suggested, studied and, more recently, attention has been given to techniques that analyze the corpus to discover word relationships (global techniques) and those that analyze documents retrieved by the initial query (local feedback). It is widely accepted that the local feedback technique is more effective than the global techniques. The approach we took for the implementation of our query expansion module was local feedback analysis. The general concept of local feedback dates back at least to a 1977 paper by Attar and Fraenkel [6]. This technique fit well with the problem we were facing. Local feedback analysis involves getting the content of the top ranked documents and generating an expanded query. This approach works best if the top raked documents are relevant, and that is the assumption when using the technique. In our implementation the expanded query is generated using the following procedure. The user supplied query is passed to the Lucene API to generate the matching lecture slides and the top 5 documents are selected. The content of the 5 documents is then used to create a list that has a pairing of a word and its rank. The ranking of words is assigned based on weighted frequency of the word in the content of the top ranked documents and the query supplied by the user. The formula used is given in (2). Rank = 8q + 4d (2) In reference to (2), q is the number of times the word appears in the query and d is the average number of times it occurs in the top documents. Once the words are ranked, we take the top 3 ranked words and combine it with the initial user query to generate the expanded query. This approach falls a little short of what we plan to achieve since it does not take into consideration words like the and a that could create noise in the result of word ranking. There are various techniques that could be used to disregard noise creating words. One approach is to create a stop list and not consider words that are in the list during the query expansion process. The other approach is to employee a mechanism that would give lower weight to common words. Our approach to solving this problem was to use our larger collection of lecture slides to figure out common words and give them lower weight. For each word that is being considered we calculated the inverse document frequency (IDF) value using (3). IDF (w) = log (N/n) (3) In reference to (3), w is the word being measured, N is the number of documents in the collection and n equals the number of documents that contain the word. Once the IDF value of a word is calculated, then it is multiplied by the weight of word to get the value used to rank the word. Essentially, this is the same as calculating TF 5 * IDF to rank words which in turn give common words lower weight. It means that the ranking of terms is done using (4). Rank = (8q + 4d) * log (N/n) (4) ORIGINAL QUERY locking file system Remote objects TABLE 2 SAMPLE OF QUERY EXPANSION EXPANDED QUERY locking strict two-phase file system directory handle nfs remote objects server pointers system-wide Learner Logger interface In the design of CD-MANIC, a great emphasis was given to capturing student interaction with the system. The idea behind this is to analyze the student interaction logs and improve the system by creating a student model that represents usage of the system. The Learner Logger interface was developed to meet this demand. Since CD-MANIC is a standalone system that executes on the student machine, the mechanism of capturing student interaction is implemented as a two-step process. First, the ability to log the student s behavior within the context of the application was built into CD-MANIC through the logger interface. Actions such as clicking on the slide index, resizing the application, searching, and selecting items from the application menu are recorded in the log file, along with a date and time stamp by the logger interface. However, no personal information about the student is ever captured or recorded in the log. The logger interface records and locally stores the log data on the client machine in a text file which is a series of variable length comma separated records. The second half of the problem is sending the data to a place where it can be retrieved and reviewed. This can be initiated either by the student through a menu item or automatically by the application. In the case of automatic update, the system will upload the log data of the student to the CD-MANIC log server in a given time interval. Students 5 TF = Term Frequency T1H-21

5 are uniquely identified using the serial number of their local hard drive. The biggest advantage of having the logger interface is the ability it provides us to study student interaction with the system and improve the system accordingly. It also allows us to capture student interaction with the system even if the student is not connected to the internet when using the system, as long as the user connects at some point. We have noticed some disadvantages to the approach of logging we are using. First, there is the potential that log data may never be retrieved from any students. For example, a student may only use the application when they are not connected to the network and so their logged data will not be sent to the server. Second, it is possible for a single student to run the application on more than one machine. Since the logger interface does not attempt to identify a single student based on personal information and the uniqueness of a student is identified by the uniqueness of their hardware, a single student has the potential to appear as multiple students in the log data. Despite these disadvantages, it seems that they do not significantly impact the data collection process due to the amount and diversity of the data that has been received by the log server [7]. JNI Connector One of the challenges we faced while integrating CD-MANIC, Lucene API and Google API was the fact that the different modules were implemented in different programming languages. CD-MANIC was built using C++ while the Google TM API and Lucene were implemented in Java. We were able to find an open source implementation of Lucene in C++, but it was not stable enough to be used in a production environment. To solve this problem, we implemented a module using the Java JNI technology [8] that serves as a Connector between the C++ implementation of CD-MANIC and the Java implementation of Lucene and the Google TM API. The JNI connector provides a messaging service to the different modules of the system by passing calls from one module to the other and by making the necessary data conversion so that the modules interact seamlessly. Refer to Figure 1 to see how this interaction occurs in the system. EXPERIMENTS To evaluate the work we have done, we performed an experiment on the Lucene API and on the effectiveness of our query expansion module on the Google TM search. Our experiment emphasized precision and recall values. Precision and Recall are calculated using (5) and (6). Recall= A / (A + C) (5) Precision =A / (A+B) (6) where A = Relevant retrieved B = Not relevant retrieved C = Relevant not retrieved While evaluating the Lucene API we discovered that as precision value goes down the recall value goes up. The observation is consistent to the general belief that recall and precision are inversely related. It is possible to have a recall rate of 1 by returning every document that has the query terms, but that will reduce the precision rate. In any implementation of a search engine, there is a tradeoff between precision and recall rate. Refer to figure 4 for the Lucene experimental results Our evaluation of the query expansion has interesting results. In general, it helped to increase the precision rate on the search done over the internet to some level. The interesting observation was that if the local search results that are used to expand the query are not relevant to the subject matter, the expanded query results are not useful. The success of the query expansion is dependant on the relevance of the local search results to the subject in consideration. In cases where the local search result is relevant, the expanded query does a great job of refining the search result we get from Google TM. A search for the query term Java with a good local search result generated various results that point to documents that talk about Java RMI. The documents that were retrieved with the local search extensively talked about Java RMI. Query expansion improves the search result we get from the Google TM search. The other interesting observation was the fact that if we keep increasing the number of terms by which the query is expanded, it will have a negative impact on the precision rate. For the case of our query expansion module, we have found that expanding the query by a maximum of 3 terms gives us the best precision rate result and going above that threshold reduces the rate. Figure 5 depicts this fact. T1H-22 Precision Precision Recall FIGURE 4 A GRAPH OF PRECISION VS. RECALL FOR LOCAL (LUCENE) SEARCH # expanded terms FIGURE 5 A GRAPH OF PRECISION VS. NUMBER OF EXPANDED TERMS

6 Figure 6 shows the runtime evaluation of the Google TM search API. The amount of time it takes to generate search results from the Google TM API increases as the length of the query increases as expected. runtime in second # of terms in query FIGURE 6 A GRAPH OF RUNTIME VS. NUMBER OF TERMS IN QUERY We have not completed evaluation of student interaction with the system since data collection began for Spring 2004 courses. We are receiving log data on a daily basis and expect to have sufficient data for evaluation soon. At this point, the collected data is not large enough to give us a complete picture of how the search feature is being used. It is our assumption that we have not received enough log information because most CD-MANIC users use the system offline and the collected data is not being sent to the log server. Based on a survey we conducted most of the users like the updated search feature and their learning experience improved because of it. Our survey also pointed out that some use CD-MANIC at work and their company is protected by a firewall. This prohibits CD-MANIC from sending the log data to the log server. CONCLUSIONS AND FUTURE WORK We described the improved search feature of CD-MANIC and the results we obtained. We showed that the query expansion mechanism we implemented for searching Google TM helped in refining the search to list relevant documents. We believe our work in adding this advanced search feature to CD-MANIC will create a constructivist-learning environment. We are continuing to improve all versions of MANIC by adding new features: Incorporating audio/video indexing to MANIC to access information that is present in audio/video recordings of lectures [12]. Adding ability for MANIC to suggest different queries to the learner when the original query returns no or few results; for example, using spell checkers and technical thesauri to suggest spelling corrections or alternatives. Combining quiz/test results with Learner Logger data to generate queries based on a model of student comprehension; for example, data on the time spent navigating and searching for certain topics can be combined with test results to lead students to supplementary material from within the course, other courses or the WWW. ACKNOWLEDGMENTS We thank James Allan and Andrew McCallum of the Center for Intelligent Information Retrieval in the Department of Computer Science at UMASS Amherst for their help during the effort of developing an improved search tool for CD- MANIC. This work was partially supported by the National Science Foundation under EIA REFERENCES [1] Thampuran, R, S, "A Multimedia Course Delivery System Combining Web and CD/DVD-Based Technologies", Thesis publication [2] [3] Google API, " [4] Tianjin, P. R, Probabilistic Query Expansion Using Query Logs ", WWW 2002 Honolulu [5] Jinxi, Xu, "Query Expansion using local and global document analysis", [6] Attar, R, "Local feedback in Full-text retrieval systems", Journal of the Association of computing Machinery [7] Burleson, W, "An Empirical Study of Student Interaction with CDbased Multimedia Courseware ", Proc of American Society of Engineering Education, 2002 [8] [9] J. D. Chase and Edward G. Okie, Combining Cooperative Learning And Peer Instruction In Introductory Computer Science, Proceedings of SIGCSE 2000, 2000 [10] Said Hadjerrouit, A Constructivist Approach to Object-Oriented Design and Programming, Proceedings of the 4th Annual Conference on Integrating Technology into Computer Science Education ITiCSE 99, 1999 [11] D. Jonassen, Designing Constructivist Learning Environments, in Instructional Theories and Models, C.M Reigeluth, Ed., 1998 [12] T1H-23

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

Search Engine Architecture II

Search Engine Architecture II Search Engine Architecture II Primary Goals of Search Engines Effectiveness (quality): to retrieve the most relevant set of documents for a query Process text and store text statistics to improve relevance

More information

Fall CS646: Information Retrieval. Lecture 2 - Introduction to Search Result Ranking. Jiepu Jiang University of Massachusetts Amherst 2016/09/12

Fall CS646: Information Retrieval. Lecture 2 - Introduction to Search Result Ranking. Jiepu Jiang University of Massachusetts Amherst 2016/09/12 Fall 2016 CS646: Information Retrieval Lecture 2 - Introduction to Search Result Ranking Jiepu Jiang University of Massachusetts Amherst 2016/09/12 More course information Programming Prerequisites Proficiency

More information

Relevancy Workbench Module. 1.0 Documentation

Relevancy Workbench Module. 1.0 Documentation Relevancy Workbench Module 1.0 Documentation Created: Table of Contents Installing the Relevancy Workbench Module 4 System Requirements 4 Standalone Relevancy Workbench 4 Deploy to a Web Container 4 Relevancy

More information

EVALUATION OF THE USABILITY OF EDUCATIONAL WEB MEDIA: A CASE STUDY OF GROU.PS

EVALUATION OF THE USABILITY OF EDUCATIONAL WEB MEDIA: A CASE STUDY OF GROU.PS EVALUATION OF THE USABILITY OF EDUCATIONAL WEB MEDIA: A CASE STUDY OF GROU.PS Turgay Baş, Hakan Tüzün Hacettepe University (TURKEY) turgaybas@hacettepe.edu.tr, htuzun@hacettepe.edu.tr Abstract In this

More information

Chapter 2. Architecture of a Search Engine

Chapter 2. Architecture of a Search Engine Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them

More information

Easy Ed: An Integration of Technologies for Multimedia Education 1

Easy Ed: An Integration of Technologies for Multimedia Education 1 Easy Ed: An Integration of Technologies for Multimedia Education 1 G. Ahanger and T.D.C. Little Multimedia Communications Laboratory Department of Electrical and Computer Engineering Boston University,

More information

Web-based File Upload and Download System

Web-based File Upload and Download System COMP4905 Honor Project Web-based File Upload and Download System Author: Yongmei Liu Student number: 100292721 Supervisor: Dr. Tony White 1 Abstract This project gives solutions of how to upload documents

More information

Blackboard 5 Level One Student Manual

Blackboard 5 Level One Student Manual Blackboard 5 Level One Student Manual Blackboard, Inc. 1899 L Street NW 5 th Floor Washington DC 20036 Copyright 2000 by Blackboard Inc. All rights reserved. No part of the contents of this manual may

More information

Development of Search Engines using Lucene: An Experience

Development of Search Engines using Lucene: An Experience Available online at www.sciencedirect.com Procedia Social and Behavioral Sciences 18 (2011) 282 286 Kongres Pengajaran dan Pembelajaran UKM, 2010 Development of Search Engines using Lucene: An Experience

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

DRACULA. CSM Turner Connor Taylor, Trevor Worth June 18th, 2015

DRACULA. CSM Turner Connor Taylor, Trevor Worth June 18th, 2015 DRACULA CSM Turner Connor Taylor, Trevor Worth June 18th, 2015 Acknowledgments Support for this work was provided by the National Science Foundation Award No. CMMI-1304383 and CMMI-1234859. Any opinions,

More information

CS377: Database Systems Text data and information. Li Xiong Department of Mathematics and Computer Science Emory University

CS377: Database Systems Text data and information. Li Xiong Department of Mathematics and Computer Science Emory University CS377: Database Systems Text data and information retrieval Li Xiong Department of Mathematics and Computer Science Emory University Outline Information Retrieval (IR) Concepts Text Preprocessing Inverted

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Open Source Search. Andreas Pesenhofer. max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria

Open Source Search. Andreas Pesenhofer. max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria Open Source Search Andreas Pesenhofer max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria max.recall information systems max.recall is a software and consulting company enabling

More information

International Journal of Scientific & Engineering Research Volume 2, Issue 12, December ISSN Web Search Engine

International Journal of Scientific & Engineering Research Volume 2, Issue 12, December ISSN Web Search Engine International Journal of Scientific & Engineering Research Volume 2, Issue 12, December-2011 1 Web Search Engine G.Hanumantha Rao*, G.NarenderΨ, B.Srinivasa Rao+, M.Srilatha* Abstract This paper explains

More information

Chapter 6: Information Retrieval and Web Search. An introduction

Chapter 6: Information Retrieval and Web Search. An introduction Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods

More information

VK Multimedia Information Systems

VK Multimedia Information Systems VK Multimedia Information Systems Mathias Lux, mlux@itec.uni-klu.ac.at This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Results Exercise 01 Exercise 02 Retrieval

More information

Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data

Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data American Journal of Applied Sciences (): -, ISSN -99 Science Publications Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data Ibrahiem M.M. El Emary and Ja'far

More information

THE WEB SEARCH ENGINE

THE WEB SEARCH ENGINE International Journal of Computer Science Engineering and Information Technology Research (IJCSEITR) Vol.1, Issue 2 Dec 2011 54-60 TJPRC Pvt. Ltd., THE WEB SEARCH ENGINE Mr.G. HANUMANTHA RAO hanu.abc@gmail.com

More information

Custom IDF weights for boosting the relevancy of retrieved documents in textual retrieval

Custom IDF weights for boosting the relevancy of retrieved documents in textual retrieval Annals of the University of Craiova, Mathematics and Computer Science Series Volume 44(2), 2017, Pages 238 248 ISSN: 1223-6934 Custom IDF weights for boosting the relevancy of retrieved documents in textual

More information

Extensible and Dynamic Data Structure Viewers in Java

Extensible and Dynamic Data Structure Viewers in Java Extensible and Dynamic Data Structure Viewers in Java Jhilmil Jain Computer Science and Software Engineering Department, Auburn University, Auburn AL Email: jainjhi@auburn.edu Problem & motivation Many

More information

DATA MINING - 1DL105, 1DL111

DATA MINING - 1DL105, 1DL111 1 DATA MINING - 1DL105, 1DL111 Fall 2007 An introductory class in data mining http://user.it.uu.se/~udbl/dut-ht2007/ alt. http://www.it.uu.se/edu/course/homepage/infoutv/ht07 Kjell Orsborn Uppsala Database

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

LucidWorks: Searching with curl October 1, 2012

LucidWorks: Searching with curl October 1, 2012 LucidWorks: Searching with curl October 1, 2012 1. Module name: LucidWorks: Searching with curl 2. Scope: Utilizing curl and the Query admin to search documents 3. Learning objectives Students will be

More information

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS 1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,

More information

A Comparison of Software-Based Distance Learning Solutions In focus Interactive Learning Systems

A Comparison of Software-Based Distance Learning Solutions In focus Interactive Learning Systems A Comparison of Software-Based Distance Learning Solutions In focus Interactive Learning Systems Cheryl Choy Introduction Within a short span of time, distance education has set a footprint in the world

More information

Introduction. Design for Strength and Endurance- Introduction. Screen Titles

Introduction. Design for Strength and Endurance- Introduction. Screen Titles Introduction Screen Titles Course Objective and Overview Content and Format Proposed Study Guide Installation and Operation Introduction - 1 - C.F. Zorowski 2002 Introduction - 2 - C.F. Zorowski 2002 Course

More information

A BFS-BASED SIMILAR CONFERENCE RETRIEVAL FRAMEWORK

A BFS-BASED SIMILAR CONFERENCE RETRIEVAL FRAMEWORK A BFS-BASED SIMILAR CONFERENCE RETRIEVAL FRAMEWORK Qing Guo 1, 2 1 Nanyang Technological University, Singapore 2 SAP Innovation Center Network,Singapore ABSTRACT Literature review is part of scientific

More information

A short introduction to the development and evaluation of Indexing systems

A short introduction to the development and evaluation of Indexing systems A short introduction to the development and evaluation of Indexing systems Danilo Croce croce@info.uniroma2.it Master of Big Data in Business SMARS LAB 3 June 2016 Outline An introduction to Lucene Main

More information

Using Algorithm Visualization to Improve Students' Understanding of Parameter Passing Methods

Using Algorithm Visualization to Improve Students' Understanding of Parameter Passing Methods Using Algorithm Visualization to Improve Students' Understanding of Parameter Passing Methods Jessica M. Gowey Computer Science Department University of Wisconsin Oshkosh goweyj98@uwosh.edu Orjola Kajo

More information

Azureus Plugin for Facebook Integration

Azureus Plugin for Facebook Integration Azureus Plugin for Facebook Integration Mike House-Vording, 100300955 mhvordin@connect.carleton.ca 1. Introduction This project seeks to integrate two major trends currently taking place on the internet:

More information

Boolean Model. Hongning Wang

Boolean Model. Hongning Wang Boolean Model Hongning Wang CS@UVa Abstraction of search engine architecture Indexed corpus Crawler Ranking procedure Doc Analyzer Doc Representation Query Rep Feedback (Query) Evaluation User Indexer

More information

Full-Text Indexing For Heritrix

Full-Text Indexing For Heritrix Full-Text Indexing For Heritrix Project Advisor: Dr. Chris Pollett Committee Members: Dr. Mark Stamp Dr. Jeffrey Smith Darshan Karia CS298 Master s Project Writing 1 2 Agenda Introduction Heritrix Design

More information

Android Basics Nanodegree Syllabus

Android Basics Nanodegree Syllabus Android Basics Nanodegree Syllabus Before You Start This is an entry-level, single term Nanodegree program with no prior programming experience required. Support Options We are here to support you every

More information

In the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google,

In the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google, 1 1.1 Introduction In the recent past, the World Wide Web has been witnessing an explosive growth. All the leading web search engines, namely, Google, Yahoo, Askjeeves, etc. are vying with each other to

More information

Tennessee. Trade & Industrial Course Web Page Design II - Site Designer Standards. A Guide to Web Development Using Adobe Dreamweaver CS3 2009

Tennessee. Trade & Industrial Course Web Page Design II - Site Designer Standards. A Guide to Web Development Using Adobe Dreamweaver CS3 2009 Tennessee Trade & Industrial Course 655745 Web Page Design II - Site Designer Standards A Guide to Web Development Using Adobe Dreamweaver CS3 2009 ation Key SE Student Edition LE Learning Expectation

More information

Tilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval

Tilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval Tilburg University Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval Publication date: 2006 Link to publication Citation for published

More information

CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL

CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL STUDIA UNIV. BABEŞ BOLYAI, INFORMATICA, Volume LVII, Number 4, 2012 CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL IOAN BADARINZA AND ADRIAN STERCA Abstract. In this paper

More information

Information Retrieval

Information Retrieval Natural Language Processing SoSe 2014 Information Retrieval Dr. Mariana Neves June 18th, 2014 (based on the slides of Dr. Saeedeh Momtazi) Outline Introduction Indexing Block 2 Document Crawling Text Processing

More information

A Document-centered Approach to a Natural Language Music Search Engine

A Document-centered Approach to a Natural Language Music Search Engine A Document-centered Approach to a Natural Language Music Search Engine Peter Knees, Tim Pohle, Markus Schedl, Dominik Schnitzer, and Klaus Seyerlehner Dept. of Computational Perception, Johannes Kepler

More information

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND 41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia

More information

Domain Specific Search Engine for Students

Domain Specific Search Engine for Students Domain Specific Search Engine for Students Domain Specific Search Engine for Students Wai Yuen Tang The Department of Computer Science City University of Hong Kong, Hong Kong wytang@cs.cityu.edu.hk Lam

More information

How to Use Panopto Video Series Transcript

How to Use Panopto Video Series Transcript How to Use Panopto Video Series Transcript Video 1: Introduction Hello new students and welcome to Auburn Agriculture Online at Auburn University! In this video, I m going to briefly show you how to use

More information

Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha

Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha 2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha Physics Institute,

More information

Notes: Notes: Primo Ranking Customization

Notes: Notes: Primo Ranking Customization Primo Ranking Customization Hello, and welcome to today s lesson entitled Ranking Customization in Primo. Like most search engines, Primo aims to present results in descending order of relevance, with

More information

Speed and Accuracy using Four Boolean Query Systems

Speed and Accuracy using Four Boolean Query Systems From:MAICS-99 Proceedings. Copyright 1999, AAAI (www.aaai.org). All rights reserved. Speed and Accuracy using Four Boolean Query Systems Michael Chui Computer Science Department and Cognitive Science Program

More information

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and

More information

INTRODUCTION. Chapter GENERAL

INTRODUCTION. Chapter GENERAL Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which

More information

Ranking Algorithms For Digital Forensic String Search Hits

Ranking Algorithms For Digital Forensic String Search Hits DIGITAL FORENSIC RESEARCH CONFERENCE Ranking Algorithms For Digital Forensic String Search Hits By Nicole Beebe and Lishu Liu Presented At The Digital Forensic Research Conference DFRWS 2014 USA Denver,

More information

Site Content Analyzer for Analysis of Web Contents and Keyword Density

Site Content Analyzer for Analysis of Web Contents and Keyword Density Site Content Analyzer for Analysis of Web Contents and Keyword Density Bharat Bhushan Asstt. Professor, Government National College, Sirsa, Haryana, (India) ABSTRACT Web searching has become a daily behavior

More information

UNIT 2 TOPICS IN COMPUTER SCIENCE. Exploring Computer Science 2

UNIT 2 TOPICS IN COMPUTER SCIENCE. Exploring Computer Science 2 UNIT 2 TOPICS IN COMPUTER SCIENCE Exploring Computer Science 2 ACM - ASSOCIATION FOR COMPUTING MACHINERY The Association for Computing Machinery (ACM) is a U.S.-based international learned society for

More information

DATA MINING II - 1DL460. Spring 2014"

DATA MINING II - 1DL460. Spring 2014 DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,

More information

Information Retrieval. (M&S Ch 15)

Information Retrieval. (M&S Ch 15) Information Retrieval (M&S Ch 15) 1 Retrieval Models A retrieval model specifies the details of: Document representation Query representation Retrieval function Determines a notion of relevance. Notion

More information

Information Retrieval

Information Retrieval Information Retrieval Course presentation João Magalhães 1 Relevance vs similarity Multimedia documents Information retrieval application Query Documents Information side User side What is the best [search

More information

doi: / _32

doi: / _32 doi: 10.1007/978-3-319-12823-8_32 Simple Document-by-Document Search Tool Fuwatto Search using Web API Masao Takaku 1 and Yuka Egusa 2 1 University of Tsukuba masao@slis.tsukuba.ac.jp 2 National Institute

More information

Natural Language Processing

Natural Language Processing Natural Language Processing Information Retrieval Potsdam, 14 June 2012 Saeedeh Momtazi Information Systems Group based on the slides of the course book Outline 2 1 Introduction 2 Indexing Block Document

More information

SUMMON WEB-SCALE DISCOVERY. ADA University Baku 02/04/2014

SUMMON WEB-SCALE DISCOVERY. ADA University Baku 02/04/2014 SUMMON WEB-SCALE DISCOVERY ADA University Baku 02/04/2014 Why an Automated Management Solution is Important Academic Library Expenditures on Purchased and Licensed Content 90% 80% 70% 60% 50% 40% 30% 20%

More information

Computer Fundamentals : Pradeep K. Sinha& Priti Sinha

Computer Fundamentals : Pradeep K. Sinha& Priti Sinha Computer Fundamentals Pradeep K. Sinha Priti Sinha Chapter 18 The Internet Slide 1/23 Learning Objectives In this chapter you will learn about: Definition and history of the Internet Its basic services

More information

How Primo Works VE. 1.1 Welcome. Notes: Published by Articulate Storyline Welcome to how Primo works.

How Primo Works VE. 1.1 Welcome. Notes: Published by Articulate Storyline   Welcome to how Primo works. How Primo Works VE 1.1 Welcome Welcome to how Primo works. 1.2 Objectives By the end of this session, you will know - What discovery, delivery, and optimization are - How the library s collections and

More information

There s No Reason Not to Localize State of Localization Benchmark Survey

There s No Reason Not to Localize State of Localization Benchmark Survey There s No Reason Not to Localize State of Localization Benchmark Survey Transifex 2014 Click me TM Localization Benchmark Study 2014 The concept of localization is becoming more important as we all become

More information

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai. UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index

More information

Chat Activity. Moodle: Collaborative Activities & Blocks. Creating Chats

Chat Activity. Moodle: Collaborative Activities & Blocks. Creating Chats Chat Activity The Moodle chat tool allows communication between instructors and students in real time. Unlike in discussion forums, which are a means for asynchronous communication, online chat sessions

More information

ELF: the Electronic Learning Facilitator. R. Kirby, R McAndrew, A. Kilgour, H. Taylor and J. T. Mayes ICBL, Heriot-Watt University, Edinburgh

ELF: the Electronic Learning Facilitator. R. Kirby, R McAndrew, A. Kilgour, H. Taylor and J. T. Mayes ICBL, Heriot-Watt University, Edinburgh ELF: the Electronic Learning Facilitator R. Kirby, R McAndrew, A. Kilgour, H. Taylor and J. T. Mayes ICBL, Heriot-Watt University, Edinburgh As the world-wide computer network becomes ubiquitous, new tools

More information

D2L vs. Canvas Mobile Experience

D2L vs. Canvas Mobile Experience Preface Keegan Long-Wheeler D2L vs. Canvas Mobile Experience This document highlights and compares the mobile experiences of using D2L and Canvas. The mobile experience was evaluated using an ipad on ios

More information

Introduction. What do you know about web in general and web-searching in specific?

Introduction. What do you know about web in general and web-searching in specific? WEB SEARCHING Introduction What do you know about web in general and web-searching in specific? Web World Wide Web (or WWW, It is called a web because the interconnections between documents resemble a

More information

VIDEO SEARCHING AND BROWSING USING VIEWFINDER

VIDEO SEARCHING AND BROWSING USING VIEWFINDER VIDEO SEARCHING AND BROWSING USING VIEWFINDER By Dan E. Albertson Dr. Javed Mostafa John Fieber Ph. D. Student Associate Professor Ph. D. Candidate Information Science Information Science Information Science

More information

Enhancing Cluster Quality by Using User Browsing Time

Enhancing Cluster Quality by Using User Browsing Time Enhancing Cluster Quality by Using User Browsing Time Rehab M. Duwairi* and Khaleifah Al.jada'** * Department of Computer Information Systems, Jordan University of Science and Technology, Irbid 22110,

More information

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm K.Parimala, Assistant Professor, MCA Department, NMS.S.Vellaichamy Nadar College, Madurai, Dr.V.Palanisamy,

More information

5 Choosing keywords Initially choosing keywords Frequent and rare keywords Evaluating the competition rates of search

5 Choosing keywords Initially choosing keywords Frequent and rare keywords Evaluating the competition rates of search Seo tutorial Seo tutorial Introduction to seo... 4 1. General seo information... 5 1.1 History of search engines... 5 1.2 Common search engine principles... 6 2. Internal ranking factors... 8 2.1 Web page

More information

A COMPARATIVE STUDY OF BYG SEARCH ENGINES

A COMPARATIVE STUDY OF BYG SEARCH ENGINES American Journal of Engineering Research (AJER) e-issn: 2320-0847 p-issn : 2320-0936 Volume-2, Issue-4, pp-39-43 www.ajer.us Research Paper Open Access A COMPARATIVE STUDY OF BYG SEARCH ENGINES Kailash

More information

WWW and Web Browser. 6.1 Objectives In this chapter we will learn about:

WWW and Web Browser. 6.1 Objectives In this chapter we will learn about: WWW and Web Browser 6.0 Introduction WWW stands for World Wide Web. WWW is a collection of interlinked hypertext pages on the Internet. Hypertext is text that references some other information that can

More information

Apache Lucene - Scoring

Apache Lucene - Scoring Grant Ingersoll Table of contents 1 Introduction...2 2 Scoring... 2 2.1 Fields and Documents... 2 2.2 Score Boosting...3 2.3 Understanding the Scoring Formula...3 2.4 The Big Picture...3 2.5 Query Classes...

More information

Course Design, Representation and Browser for Web Based Education

Course Design, Representation and Browser for Web Based Education Course Design, Representation and Browser for Web Based Education KUNAL CHAWLA Department of Information Technology Indian Institute of Information Technology Allahabad, Uttar Pradesh INDIA Abstract: -

More information

Enhancing Cluster Quality by Using User Browsing Time

Enhancing Cluster Quality by Using User Browsing Time Enhancing Cluster Quality by Using User Browsing Time Rehab Duwairi Dept. of Computer Information Systems Jordan Univ. of Sc. and Technology Irbid, Jordan rehab@just.edu.jo Khaleifah Al.jada' Dept. of

More information

Information Retrieval

Information Retrieval Introduction Information Retrieval Information retrieval is a field concerned with the structure, analysis, organization, storage, searching and retrieval of information Gerard Salton, 1968 J. Pei: Information

More information

Information Retrieval

Information Retrieval Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have

More information

Creating a Classifier for a Focused Web Crawler

Creating a Classifier for a Focused Web Crawler Creating a Classifier for a Focused Web Crawler Nathan Moeller December 16, 2015 1 Abstract With the increasing size of the web, it can be hard to find high quality content with traditional search engines.

More information

Search Engines Information Retrieval in Practice

Search Engines Information Retrieval in Practice Search Engines Information Retrieval in Practice W. BRUCE CROFT University of Massachusetts, Amherst DONALD METZLER Yahoo! Research TREVOR STROHMAN Google Inc. ----- PEARSON Boston Columbus Indianapolis

More information

Discovering Information through Summon:

Discovering Information through Summon: Discovering Information through Summon: An Analysis of User Search Strategies and Search Success Ingrid Hsieh-Yee Professor, Dept. of Library and Information Science, Catholic University of America Shanyun

More information

Administrative. Web crawlers. Web Crawlers and Link Analysis!

Administrative. Web crawlers. Web Crawlers and Link Analysis! Web Crawlers and Link Analysis! David Kauchak cs458 Fall 2011 adapted from: http://www.stanford.edu/class/cs276/handouts/lecture15-linkanalysis.ppt http://webcourse.cs.technion.ac.il/236522/spring2007/ho/wcfiles/tutorial05.ppt

More information

Adaptable and Adaptive Web Information Systems. Lecture 1: Introduction

Adaptable and Adaptive Web Information Systems. Lecture 1: Introduction Adaptable and Adaptive Web Information Systems School of Computer Science and Information Systems Birkbeck College University of London Lecture 1: Introduction George Magoulas gmagoulas@dcs.bbk.ac.uk October

More information

OUTCOMES BASED LEARNILNG MATRIX

OUTCOMES BASED LEARNILNG MATRIX CTIM287 HTML5 for Websites and Applications OUTCOMES BASED LEARNILNG MATRIX Course: CTIM287 HTML5 for Websites and Applications (3 credits, 45 hours) Department: Computer Technology and Information Management

More information

A Survey on Information Extraction in Web Searches Using Web Services

A Survey on Information Extraction in Web Searches Using Web Services A Survey on Information Extraction in Web Searches Using Web Services Maind Neelam R., Sunita Nandgave Department of Computer Engineering, G.H.Raisoni College of Engineering and Management, wagholi, India

More information

LIST OF ACRONYMS & ABBREVIATIONS

LIST OF ACRONYMS & ABBREVIATIONS LIST OF ACRONYMS & ABBREVIATIONS ARPA CBFSE CBR CS CSE FiPRA GUI HITS HTML HTTP HyPRA NoRPRA ODP PR RBSE RS SE TF-IDF UI URI URL W3 W3C WePRA WP WWW Alpha Page Rank Algorithm Context based Focused Search

More information

by D2L CONTENT DISCUSSIONS STUDENT CHANGES August, 2015 New Name, Same System!

by D2L CONTENT DISCUSSIONS STUDENT CHANGES August, 2015 New Name, Same System! by D2L August, 2015 = New Name, Same System! CONTENT Update Description Visual Reference Add video and audio directly to the Content module. Drag-n-drop content now lets you choose where to store the background

More information

A User Study on Features Supporting Subjective Relevance for Information Retrieval Interfaces

A User Study on Features Supporting Subjective Relevance for Information Retrieval Interfaces A user study on features supporting subjective relevance for information retrieval interfaces Lee, S.S., Theng, Y.L, Goh, H.L.D., & Foo, S. (2006). Proc. 9th International Conference of Asian Digital Libraries

More information

Computer Science 572 Midterm Prof. Horowitz Thursday, March 8, 2012, 2:00pm 3:00pm

Computer Science 572 Midterm Prof. Horowitz Thursday, March 8, 2012, 2:00pm 3:00pm Computer Science 572 Midterm Prof. Horowitz Thursday, March 8, 2012, 2:00pm 3:00pm Name: Student Id Number: 1. This is a closed book exam. 2. Please answer all questions. 3. There are a total of 40 questions.

More information

Chapter 126 TEKS for Technology Applications

Chapter 126 TEKS for Technology Applications Chapter 126 TEKS for Technology Applications What is Chapter 126? Chapter 126 outlines the skill levels that have to be achieved by all Texas students per year through the 8 th grade. Since many schools

More information

CS54701: Information Retrieval

CS54701: Information Retrieval CS54701: Information Retrieval Basic Concepts 19 January 2016 Prof. Chris Clifton 1 Text Representation: Process of Indexing Remove Stopword, Stemming, Phrase Extraction etc Document Parser Extract useful

More information

SIMSme Management Cockpit Documentation

SIMSme Management Cockpit Documentation Introduction SIMSme Management Cockpit Documentation Version 2.1 February 2018 Table of Contents 1 INTRODUCTION... 2 2 USERS... 3 3 LICENSES... 5 4 GROUPS... 7 5 CHANNELS... 8 6 DASHBOARD...10 7 APP SETTINGS...12

More information

A New Measure of the Cluster Hypothesis

A New Measure of the Cluster Hypothesis A New Measure of the Cluster Hypothesis Mark D. Smucker 1 and James Allan 2 1 Department of Management Sciences University of Waterloo 2 Center for Intelligent Information Retrieval Department of Computer

More information

National Training and Education Resource. Authoring Course. Participant Guide

National Training and Education Resource. Authoring Course. Participant Guide National Training and Education Resource Authoring Course Participant Guide Table of Contents: OBJECTIVES... 4 OVERVIEW OF NTER... 5 System Requirements... 5 NTER Capabilities... 6 What is the SCORM PlayerWhat

More information

ANDROID APPLICATION DEVELOPMENT COURSE Training Program

ANDROID APPLICATION DEVELOPMENT COURSE Training Program ANDROID APPLICATION DEVELOPMENT COURSE Training Program This Android Application Development Course is a 24 Hours Training Program with Certification. The program is to be developed for Matriculated, Intermediate

More information

CSCI 201L Syllabus Principles of Software Development Spring 2018

CSCI 201L Syllabus Principles of Software Development Spring 2018 L Syllabus Principles of Software Development Spring 2018 Instructor: Jeffrey Miller, Ph.D. Email: jeffrey.miller@usc.edu Web Page: http://www-scf.usc.edu/~csci201 Office: SAL 342 Phone: 213-740-7129 Lectures:

More information

Content Based Smart Crawler For Efficiently Harvesting Deep Web Interface

Content Based Smart Crawler For Efficiently Harvesting Deep Web Interface Content Based Smart Crawler For Efficiently Harvesting Deep Web Interface Prof. T.P.Aher(ME), Ms.Rupal R.Boob, Ms.Saburi V.Dhole, Ms.Dipika B.Avhad, Ms.Suvarna S.Burkul 1 Assistant Professor, Computer

More information

CE 4.1 (MP) to Vista Enterprise Migration Guide for Designers and Instructors

CE 4.1 (MP) to Vista Enterprise Migration Guide for Designers and Instructors Blackboard Learning System CE 4.1 (MP) to Vista Enterprise Migration Guide for Designers and Instructors Application Pack 1 for Blackboard Learning System Vista Enterprise License (Release 4), Service

More information

SEARCH TECHNIQUES: BASIC AND ADVANCED

SEARCH TECHNIQUES: BASIC AND ADVANCED 17 SEARCH TECHNIQUES: BASIC AND ADVANCED 17.1 INTRODUCTION Searching is the activity of looking thoroughly in order to find something. In library and information science, searching refers to looking through

More information

Brightspace by D2L Upgrade Training

Brightspace by D2L Upgrade Training Brightspace by D2L Upgrade Training August 2015 This document provides an overview of some new features, and improvements to existing features implemented in the latest upgrade. GFC MSU is now participating

More information

Lecture #3: PageRank Algorithm The Mathematics of Google Search

Lecture #3: PageRank Algorithm The Mathematics of Google Search Lecture #3: PageRank Algorithm The Mathematics of Google Search We live in a computer era. Internet is part of our everyday lives and information is only a click away. Just open your favorite search engine,

More information