A. Krishna Mohan *1, Harika Yelisala #1, MHM Krishna Prasad #2

Size: px
Start display at page:

Download "A. Krishna Mohan *1, Harika Yelisala #1, MHM Krishna Prasad #2"

Transcription

1 Vol. 2, Issue 3, May-Jun 212, pp IR Tree An Adept Index for Handling Geographic Document ing A. Krishna Mohan *1, Harika Yelisala #1, MHM Krishna Prasad #2 *1,#2 Associate professor, Dept. of Computer Science Jawaharlal Nehru Technological University Kakinada, Andhra Pradesh. #1 Mtech, Department of Computer Science, JNTU Kakinada, Andhra Pradesh. Abstract In processing a location based query, that is propel to a geographic search engine, it retrieves documents that are more textual and spatially relevant. After retrieval search engine ranks the documents based on their combined relevance of textual and spatial aspects to query. Here in this case, both textual and spatial information should be indexed. Existing systems cannot handle both textual and spatial aspects simultaneously, so an adept index is needed that can handle textual and spatial aspects simultaneously. In this paper, we propose an efficient index called IR Tree which indexes textual and spatial information in an integrated way. IR Tree along with Top K document retrieval algorithm performs spatial and textual filtering, relevance computation, and also document ranking by considering the document length i.e., relevance is computed using document length. Keywords- Geographic document searching, indexing, IR Tree, Top- K document Retrieval. 1. INTRODUCTION Internet is the best option to gather information these days. Any information is stored in form of documents, so storage of large number of documents is required. In order to access few documents search engine has to check all the documents one by one. So to avoid that situation an efficient search engine is needed. A search engine is said to be efficient if it retrieves documents that are highly relevant and retrieved in a short latency. To do that an adept index is needed. Index is a data structure that improves the speed of data retrieval operations on a database table. It helps in efficient retrieval and access of required documents. Location based queries are the queries which are related to some location. Location based query are used in many ways like finding weather in particular location, knowing tourist spots. In some cases interviewer wants to check the capability of the applicant. At times the applicant is given some unfamiliar address as the interview venue. So the applicant searches the location along with the name of the company. For example let the venue be near Rose Burg hotel, Oregon. Now the applicant can search the web to get the information. Some of the existing searching engines retrieve the information that are related to Roseburg and some documents that are related to hotels, some related to Oregon place. So to get the correct information search engine must consider the entire keywords Roseburg hotel Oregon place into consideration. Let the company name be ABC. So search engine must help the applicant to get the correct and relevant information. Example1. Let there be 6 documents or web pages in the web server. W={W 1, W 2,. W 6 }. These are the locations that are closely located to Oregon. Each web page consists of some textual words too. The frequencies of three words are shown in the fig 1. Now the problem is to find the documents that are most relevant. A document is said to be concerned if at least one keyword is matched and along with that it should overlap with the given query location. If more textual words are matches and if the document location overlaps or is merged in the query spatial scope then it is said to be more relevant. In all the documents w 1 is more related to given query as the frequency of each word is more matched. So W 1 is first retrieved and then w 5 (if the documents to be retrieved is 2) If the documents are more in number then an efficient index is needed. In this paper, we propose spatial relevance, Textual relevance, ranking in an adept way. Figure 1: Representation of document spatial distribution and frequency of each query word in document set 2. RELATED WORK There are two types of existing systems that can handle location based queries. They are: 2.1. Individual index structures These are the indexing structures where two indexing structures are needed such that one for spatial and one for textual.[1], [2]. Query processing is done in such a way that textual documents are filtered using textual index and now the textually filtered spatial documents are filtered using spatial index. Now the relevance of the intermediate results is computed based on the joint textual and spatial aspects. The main disadvantage of this kind of indexing structure are intermediate results obtained in between textual and spatial filtering may contain very large sets of documents that may not lie under the required spatial scope. Processing time and memory to store those intermediate results is not that well advised. So hybrid index structures came into existence P a g e

2 Vol. 2, Issue 3, May-Jun 212, pp Building two indexing structures and implementing relevance computation is also expensive Hybrid Index Structures: These are the indexing structures where spatial and textual aspects are considered simultaneously. Indexing is built one over another. Here search is done for textual aspects based on the spatial aspects or vice versa. In some hybrid indexes the location index and textual index are combined to a new word that forms a new index [1], [2]. And these kinds cannot be used by many spatial based applications. And also memory for newly created index words is also needed. Some of the hybrid indexes like KR-* use R* index inverted file structure by combining each node of R tree with the list of all the textual keywords present in the object of that sub tree. 3. PROBLEM DEFINITION Documents are nothing but web pages. Let web pages be denoted as W= {W 1, W 2,., W n } which consists of say n pages. Each page consists of set of textual keywords T w and set of spatial locations S w. Location based query is the query which specifies set of textual keywords Tq and spatial scope Sq. In this paper query is represented as Q (Tq, Sq) and web page as W (T w, S w ). General Textual Relevance computation: A web page w is said to be textually relevant to query Q(tq, sq) if all or some of the textual keywords of the query are present in web page. Spatial Relevance computation: A location in the document w is said to spatially related to query location Sq only if document location S w overlaps either completely or partial with the query location S q. Geographic Web page searching: ing is finding the documents that are both textually and spatially related to a given query. In technical way, Geographic document search engine identifies the web pages in web pages set W that are both textually and spatially relevant to the query Q (Tq, Sq). Geographic Webpage Ranking: Ranking the geographic documents based on some order is called Geographic Webpage Ranking. User is interested in only top K web pages. All the documents are retrieved such that descending relevance order is taken, and retrieved accordingly. As Textual and spatial are two different aspects and have their own importance, for relevance computation they are assigned some weights. It is represented as: R(W) = δ* T(W)+ (1-δ)* S(W) iff T(W)> and S(W) > = otherwise Relevance of a word is estimated using the weighted sum of textual and spatial relevance where δ is a parameter to manage the relative weights. Document length: Documents vary by their length. There may be two documents that are relevant to the given query. When compared one document may be very small and other may be very large. On computing the relevance the both may be interesting ones but the large one may have more information than smaller one, even then the smaller document may be more relevant. So document length must be considered in computing the relevance. 4. IR TREE IR tree is a tree data structure which is used as an index to handle location based queries. IR tree is designed such that it performs spatial clustering first and then textual filtering. Here first spatial filtering is done so that search space can be abridged because there may be many documents that are textually related but only very few of those are bounded within spatial scope. Now textual filtering is done so as to reduce search cost. Finally, the joint relevance and ranking is done simultaneously such that, as soon as top k (the number of documents to be retrieved) documents are obtained the search process stops. Coming to the design issue, index structure must be designed in proper way as each textual word in documents is treated as a dimension. Document space need to cover many very high dimensional spaces. In addition to that spatial locations and textual words have their own representations and measurements. So index must integrate these two aspects so that they must be compatible. Our IR Tree is designed to perform spatial filtering, textual filtering, relevance computation, and ranking simultaneously. Even storage and access overheads are considered IR Tree Strucutre IR tree is designed in such a way that it clusters spatial documents and abstracts textual documents under various granularities [1]. All the spatially related documents are clustered so that any document that does not belong to that region requested by the user, can be pruned as and then as unrelated. All textual words are represented using inverted files. Each node has document précis such that if the query keyword is present in that node then it can traverse according to the nodes pointing it. IR tree is a collection of nodes. It consists of a root node, few non leaf nodes, and few leaf nodes Leaf nodes Each leaf node is linked to an inverted file. All the inverted files consist of list of words, such that each word is pointed to list of documents that contain the particular word. It can be represented as shown in fig 2. Figure 2: Leaf node representation Non leaf nodes 1434 P a g e

3 Vol. 2, Issue 3, May-Jun 212, pp All the non leaf nodes consist of document précis. Document précis is nothing but collection of information regarding node s spatial region, number of documents that come under that particular node. It even contains the WF and IWF. It is shown in fig 3. In brief, let the non leaf node be node i, then Figure 3: Non leaf node representation. assuming node i will have many children nodes to node i. Document précis contains 1. M i : It is the Minimal Bounding Box that covers all the locations of the documents under node i. It is nothing but a small rectangular region that covers all the locations in the document set under the node i. 2. W i : It is the cardinality of the documents that come under the node i. i.e., the number of documents that come under node i. 3. WF and IWF pair values: WF t,w is the Word Frequency i.e.; it is the measure of frequency of a word t that occurs in a document w. IWF t,w is the Inverse Web page Frequency [6], the number of documents in the document set W that contain one or more occurrences of textual word t. This pair helps in computing the relevance just by checking the WF and IWF values as the node need not be considered if the pair value is low IR Tree operations Major operations performed on any Tree structures are insertion, deletion, traversal, and update. First tree has to be constructed, for that set of documents is needed. As mentioned earlier, IR tree first spatially clusters all the documents. i.e., all the spatially related locations are grouped into a set. IR tree construction involves a bottom up fashion. For example, as shown in fig 4, all the documents that are related to Los Angeles are grouped and the documents related to Oakley are grouped and those are grouped under California. Generalization is done in IR tree construction. Figure 4: Spatial Clustering Representation. Here we assume that every document is associated with one location. Algorithm for IR tree construction: Input: A document set, W; Minimal node Fan out, Min; Maximal node Fan out, Max; Output: Root of IR tree. Method: 1: N i 2: for each w ε W do 3: { 4: geocode w and represent S w with MBB M w ; 5: if i ε N i, M i =M w then 6: update w to i s document set W i ; 7: else 8: create new entry i; 9: set M i M w and also W i {w}; 1: N i N i {i}; 11: } 12: for each i ε N i do 13: { 14: construct inverted list for each word in the document of N i ; 15: } 16: while N i > max do 17: { 18: cluster N i based on the min and max into nodes represented as new entries N`I; 19: Prepare document précis for i in N`I; 2: N i N`I; 21: } 22: create root node to layer N i and its document précis; 23: return root node as output; In brief, (Line 1) first a root node is initialized to null and then (Line 2 11)every webpage in the webpage set is geocoded and all the locations of the web pages are represented using Maximum Bounding Boxes. At the same time, clustering is done by the spatially relevant ones. If the document location matches with the existing node set then it can be grouped, otherwise we create another node set. (Line 12-15) construct the inverted list for each word in that node. After that arrange nodes as per max and minimum fan outs and then root is created and returned as output. Insertion, deletion involves updating of node sets, document précis and these insertion and deletion are similar to R tree operations Query Processing In order to process the query, WF and IWF values are to be calculated. IWF calculation is explained in algorithm 2. Finally Top K document retrieval is explained in algorithm 3. Algorithm 2: Input: Root of the tree R, Query (T q, S q ) Output: IWF values for each word in the query, Set of nodes N; Method: 1: set W s =; WF t = for each t in T q ; 2: push R to an empty stack K; 3: while K is not empty do 4: { 1435 P a g e

4 Vol. 2, Issue 3, May-Jun 212, pp : pop an entry i from K; 6: if M i S q then 7: if M i S q then 8: W s W s + W i ; 9: if t Tq > then 1: WF t WF t + WF t,w, t T q ; 11: B B {i}; 12: end if 13: else 14: push all child entries of i to K; 15: end if; 16: end if; 17: end while; 18: Return IWF t,w,sq = ln(1+ Ws /WF t ); This algorithtm calculates IWF value for each Query keyword. First all the nodes are taken into an empty stack and then if the spatial location of query and document location overlaps then the documents under the node are counted. Along with that WF values are also updated and finally the nodes which are mapped with query scope and processed are taken into buffer B. Algorithm 3: Top K document retrieval Buffer B is passed to the Top K document retrieval algorithm so that all the nodes that are related to query are obtained. The main motive is that k documents that mostly spatially and textually related are to be retrieved. Once k documents are retrieved the process must halt. Algorithm 3: Input: interest set B,<WF,IWF> pair, Q ( T q, S q ), δ), k value; Output: K documents retrieved C Method: 1: # define R(i) = δ* t Tq WF IWF WL + (1-δ)/dist(M i, S q ); // consideration of document length 2: for i B do 3: enqueue (i, R(i)) to Q; //all the entries in B are queued.// end for 4: while Q is not empty do 5: dequeue an entry j from Q; 6: if j is a document then 7: C C {j}; 8: if C = k then 9: goto 16; //end if 1: else if j is a leaf entry then 11: for each document w in j s inverted list, S w, t T q do 12: enqueue (w, R (w)) to Q; //end for 13: else 14: for each child g to j do 15: enqueue (g, R (g)) to Q; // end for, if, while 16: output C. 5. EXPERIMENTAL RESULTS These are the results that are examined based on two real data sets namely LATimes 94 and Factiva. The data set LATimes 94 consists of 11,273 documents which includes 2,119 locations. Its average number of words per a document is 54 and number of indexed document words are 9,986. The total size is 421 MB. The other data set Factiva consists of 38,76 documents which includes 47 locations. Its average number of words per documents is 522 and number of indexed document words are 13,286. The total size is 256 MB. First, all the location names are extracted from every single document and these locations are geocoded into Maximum Bounding Boxes (MBB). Geocode is done based on the ontology that covers over 1, 29,784 worldwide locations. Considering some factors like area size, population size...etc of locations they are divided as small city, medium city, large city, state, country these MBB are constructed [3]. Experimental results are generated based on the search time by considering different parameters. Here search time is compared with KR * and Hybrid R. KR * tree is the index structure that combines the textual words with spatial objects in non leaf nodes such that it supports both textual and spatial filtering simultaneously and then it ranks based on joint relevance. Hybrid R is index structure which is implemented by filtering spatial documents first and then textual relevant ones as R tree is placed on the top of the inverted files. ing Efficiency: Efficiency is estimated based on some factors which have their own impact on searching [4], [5]. There are three most important factors to be considered. They are: 1. Size of the query spatial scope S ; 2. Number of requested documents k; 3. Relative significance of textual relevance to spatial relevance. By keeping two parameters constant and varying one parameter the search time will be varied. So let us see how the search time varied for the above mentioned data sets Impact of S : Query spatial scope size is one of the important factors that change the performance of the searching. The values that suit the query spatial scope (in km) are: 1 2, 2 2, 1 2, 5 2. As the spatial locations are represented as Maximum Bounding Boxes they are measured in the form of square area. So in our IR implementation the scope is limited from 1*1 km 2 to 5*5 km 2. First by keeping k value, i.e., number of documents value and relative textual relevance δ constant, say k=1 i.e., retrieval of top 1 documents and δ value to.5 and varying the query spatial scope size, the results are 1436 P a g e

5 Vol. 2, Issue 3, May-Jun 212, pp obtained Figure 8: Effect of K on data set LAtimes Figure 5: Effect of s on the data set Factiva. as per the range of the spatial scope. For the dataset Factiva and LAtimes 94 the results are plotted as shown in fig 5 and fig 6 respectively. Figure 6: Effect of S on the data set LAtimes Impact of K: The number of documents to be retrieved, K is one important factor which is the main estimate of time. The k value can be 1, 3, 5, 1, 3..etc. By fixing S =1*1 km 2 and δ to.5 and varying k and applying for the taken datasets leads to the shown variations in the fig 7 and fig 8. In both cases IR tree performs well when compared to KR * and Hybrid r. Of all the index structures, IR performs best and Hybrid r performs very badly. 5 Figure 7: Effect of K on data set Factiva. 5 Factiva S LAtimes' S K IR TREE tree hybrid IR Tree Hybridr Whatever may be the k value, IR Tree retrieved the documents in very efficient time LA'times94 5 Factiva K IR Tree IR TREE Hybridr Impact of δ: The third factor to be considered is the relative significance of textual and spatial relevance. Hybrid and KR * tree retrieves the documents and then it ranks the documents based on joint significance. Whereas IR tree simultaneously computes the relevance and ranks the document. But, as δ increases the search time also increases. Now by keeping δ value varying and fixing K and S values to 1 and 1*1 km 2 we got the results as plotted in fig 9 and fig 1 for the data sets taken. When compared to Hybrid r and KR *, IR tree performs very well because of it storage structure and its top k document retrieval algorithm. A lot of time is saved due to incremental top k search algorithm, so there is no need to rank the documents after retrieval. 5 LAtimes' δ Figure 9: Effect of δ on data set LAtimes Time( ms) Factiva δ IR TREE Figure 1: Effect of δ on data set Factive. IR tree Hybridr Even though the storage is a bit more due to presence of WF and IWF in internal nodes, but that makes search efficient as it need not enter the nodes that does not satisfy the location or the word. 6. CONCLUSION In this paper, we proposed an efficient index structure namely IR Tree, that handles textual filtering, spatial filtering at a stretch. We mainly focused on consideration of document length in relevance computation. All the top k documents are retrieved in incremental approach. The experiment proved that IR Tree is an adept index and performed sound. Future work can be enhanced to perform semantic similarity between query keywords P a g e

6 Vol. 2, Issue 3, May-Jun 212, pp REFERENCES [1]. Zhisheng Li, Ken C.K Lee, Baihua Zheng, Wang-Chien, Dik Lun Lee, Xufa Wang, IR Harika Yelisala received her BTech from the Department of Tree: An Efficient Index for Geographic Computer Science, Acharya Nagarjuna university in the year Document IEEE Vol 23, No 4, April 29. She is pursuing her Mtech in Jawaharlal Nehru 211. Technological University, Kakinada campus during 21- [2]. Khodaei A, Cyrus Shahabi, Chen Li, SKIF-P: 212. Her research interests includes Cloud Computing, Data A point based Indexing and Ranking of Web Mining. Documents for Spatial Keyword Geoinformatica. Krishna Mohan A is currently working as an Associate [3]. E. Amitay, N.Har El, R. Sivan, A. Soffer, Professor under the department of Computer Science in Web-A-Where: Geotagging Web Content, Jawaharlal Nehru Technological University, Kakinada. His Proc. ACM Sigir 4, pp research includes Data Mining. [4]. I.D. Felipe, V. Hristidis, N.Rishe, Keyword on Spatial Databases, Proc IEEE 24 th Krishna Prasad MHM is currently working as an Associate Int l conference Data Engg(ICDE 8), pp 656- Professor under the department of Computer Science in 665, 28. Jawaharlal Nehru Technological University, vijayanagaram. His research includes Data Mining. [5]. Khodaei A, Shahabi C, Li C(21), Hybird Indexing and Seamless Ranking of Spatial and Textual Features of Web Documents in DEXA, PP [6]. D. Hiemstra, A Probabilistic Justification for using TF IDF Term Weighting in Information Retrieval, Int l j.digital Libraries, Vol 3, No 2, pp , P a g e

ISSN: (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies

ISSN: (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies ISSN: 2321-7782 (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Paper / Case Study Available online at: www.ijarcsms.com

More information

Efficient Index Based Query Keyword Search in the Spatial Database

Efficient Index Based Query Keyword Search in the Spatial Database Advances in Computational Sciences and Technology ISSN 0973-6107 Volume 10, Number 5 (2017) pp. 1517-1529 Research India Publications http://www.ripublication.com Efficient Index Based Query Keyword Search

More information

Spatial Index Keyword Search in Multi- Dimensional Database

Spatial Index Keyword Search in Multi- Dimensional Database Spatial Index Keyword Search in Multi- Dimensional Database Sushma Ahirrao M. E Student, Department of Computer Engineering, GHRIEM, Jalgaon, India ABSTRACT: Nearest neighbor search in multimedia databases

More information

INFREQUENT WEIGHTED ITEM SET MINING USING NODE SET BASED ALGORITHM

INFREQUENT WEIGHTED ITEM SET MINING USING NODE SET BASED ALGORITHM INFREQUENT WEIGHTED ITEM SET MINING USING NODE SET BASED ALGORITHM G.Amlu #1 S.Chandralekha #2 and PraveenKumar *1 # B.Tech, Information Technology, Anand Institute of Higher Technology, Chennai, India

More information

Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index

Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index B. SATYA MOUNIKA 1, J. VENKATA KRISHNA 2 1 M-Tech Dept. of CSE SreeVahini Institute of Science and Technology TiruvuruAndhra

More information

ISSN: (Online) Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies

ISSN: (Online) Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies ISSN: 2321-7782 (Online) Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online

More information

Closest Keywords Search on Spatial Databases

Closest Keywords Search on Spatial Databases Closest Keywords Search on Spatial Databases 1 A. YOJANA, 2 Dr. A. SHARADA 1 M. Tech Student, Department of CSE, G.Narayanamma Institute of Technology & Science, Telangana, India. 2 Associate Professor,

More information

Mining Data Streams. Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction. Summarization Methods. Clustering Data Streams

Mining Data Streams. Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction. Summarization Methods. Clustering Data Streams Mining Data Streams Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction Summarization Methods Clustering Data Streams Data Stream Classification Temporal Models CMPT 843, SFU, Martin Ester, 1-06

More information

Ranking Spatial Data by Quality Preferences

Ranking Spatial Data by Quality Preferences International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 7 (July 2014), PP.68-75 Ranking Spatial Data by Quality Preferences Satyanarayana

More information

Chapter 6: Information Retrieval and Web Search. An introduction

Chapter 6: Information Retrieval and Web Search. An introduction Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods

More information

AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE

AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE Vandit Agarwal 1, Mandhani Kushal 2 and Preetham Kumar 3

More information

Hybrid Indexing and Seamless Ranking of Spatial and Textual Features of Web Documents

Hybrid Indexing and Seamless Ranking of Spatial and Textual Features of Web Documents Hybrid Indexing and Seamless Ranking of Spatial and Textual Features of Web Documents Ali Khodaei 1, Cyrus Shahabi 1, and Chen Li 2 1 Department of Computer Science, University of Southern California,

More information

MAINTAIN TOP-K RESULTS USING SIMILARITY CLUSTERING IN RELATIONAL DATABASE

MAINTAIN TOP-K RESULTS USING SIMILARITY CLUSTERING IN RELATIONAL DATABASE INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 MAINTAIN TOP-K RESULTS USING SIMILARITY CLUSTERING IN RELATIONAL DATABASE Syamily K.R 1, Belfin R.V 2 1 PG student,

More information

Secure and Advanced Best Keyword Cover Search over Spatial Database

Secure and Advanced Best Keyword Cover Search over Spatial Database Secure and Advanced Best Keyword Cover Search over Spatial Database Sweety Thakare 1, Pritam Patil 2, Tarade Priyanka 3, Sonawane Prajakta 4, Prof. Pathak K.R. 4 B. E Student, Dept. of Computer Engineering,

More information

An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees

An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees An Implementation of Tree Pattern Matching Algorithms for Enhancement of Query Processing Operations in Large XML Trees N. Murugesan 1 and R.Santhosh 2 1 PG Scholar, 2 Assistant Professor, Department of

More information

TRIE BASED METHODS FOR STRING SIMILARTIY JOINS

TRIE BASED METHODS FOR STRING SIMILARTIY JOINS TRIE BASED METHODS FOR STRING SIMILARTIY JOINS Venkat Charan Varma Buddharaju #10498995 Department of Computer and Information Science University of MIssissippi ENGR-654 INFORMATION SYSTEM PRINCIPLES RESEARCH

More information

Batch processing of Top-k Spatial-textual Queries

Batch processing of Top-k Spatial-textual Queries processing of Top-k Spatial-textual Queries Farhana M. Choudhury J. Shane Culpepper Timos Sellis School of CSIT, RMIT University, Melbourne, Australia {farhana.choudhury,shane.culpepper,timos.sellis}@rmit.edu.au

More information

SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search

SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search DOI 10.1007/s10707-011-0142-7 SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search Ali Khodaei Cyrus Shahabi Chen Li Received: 18 January 2011 / Revised: 19 September

More information

Top-K Ranking Spatial Queries over Filtering Data

Top-K Ranking Spatial Queries over Filtering Data Top-K Ranking Spatial Queries over Filtering Data 1 Lakkapragada Prasanth, 2 Krishna Chaitanya, 1 Student, 2 Assistant Professor, NRL Instiute of Technology,Vijayawada Abstract: A spatial preference query

More information

Data Structure. IBPS SO (IT- Officer) Exam 2017

Data Structure. IBPS SO (IT- Officer) Exam 2017 Data Structure IBPS SO (IT- Officer) Exam 2017 Data Structure: In computer science, a data structure is a way of storing and organizing data in a computer s memory so that it can be used efficiently. Data

More information

A New Technique to Optimize User s Browsing Session using Data Mining

A New Technique to Optimize User s Browsing Session using Data Mining Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

Information Retrieval. Wesley Mathew

Information Retrieval. Wesley Mathew Information Retrieval Wesley Mathew 30-11-2012 Introduction and motivation Indexing methods B-Tree and the B+ Tree R-Tree IR- Tree Location-aware top-k text query 2 An increasing amount of trajectory data

More information

Enhanced Methodology for supporting approximate string search in Geospatial data

Enhanced Methodology for supporting approximate string search in Geospatial data International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Enhanced Methodology for supporting approximate string search in Geospatial data Ashwina.R 1, Mrs.T.Megala 2 1, 2 (MCA-III year,

More information

Text Data Pre-processing and Dimensionality Reduction Techniques for Document Clustering

Text Data Pre-processing and Dimensionality Reduction Techniques for Document Clustering Text Data Pre-processing and Dimensionality Reduction Techniques for Document Clustering A. Anil Kumar Dept of CSE Sri Sivani College of Engineering Srikakulam, India S.Chandrasekhar Dept of CSE Sri Sivani

More information

Multidimensional Indexing The R Tree

Multidimensional Indexing The R Tree Multidimensional Indexing The R Tree Module 7, Lecture 1 Database Management Systems, R. Ramakrishnan 1 Single-Dimensional Indexes B+ trees are fundamentally single-dimensional indexes. When we create

More information

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

An Overview of various methodologies used in Data set Preparation for Data mining Analysis An Overview of various methodologies used in Data set Preparation for Data mining Analysis Arun P Kuttappan 1, P Saranya 2 1 M. E Student, Dept. of Computer Science and Engineering, Gnanamani College of

More information

A Survey Of Algorithms Related To Xml Based Pattern Matching

A Survey Of Algorithms Related To Xml Based Pattern Matching A Survey Of Algorithms Related To Xml Based Pattern Matching Dr.R.Sivarama Prasad 1, D.Bujji Babu 2, Sk.Habeeb 3, Sd.Jasmin 4 1 Coordinator,International Business Studies, Acharya Nagarjuna University,Guntur,A.P,India,

More information

International Journal of Research in Computer and Communication Technology, Vol 3, Issue 11, November

International Journal of Research in Computer and Communication Technology, Vol 3, Issue 11, November Classified Average Precision (CAP) To Evaluate The Performance of Inferring User Search Goals 1H.M.Sameera, 2 N.Rajesh Babu 1,2Dept. of CSE, PYDAH College of Engineering, Patavala,Kakinada, E.g.dt,AP,

More information

DS504/CS586: Big Data Analytics Data Management Prof. Yanhua Li

DS504/CS586: Big Data Analytics Data Management Prof. Yanhua Li Welcome to DS504/CS586: Big Data Analytics Data Management Prof. Yanhua Li Time: 6:00pm 8:50pm R Location: KH 116 Fall 2017 First Grading for Reading Assignment Weka v 6 weeks v https://weka.waikato.ac.nz/dataminingwithweka/preview

More information

Efficient Adjacent Neighbor Expansion Search Keyword

Efficient Adjacent Neighbor Expansion Search Keyword International Journal for Modern Trends in Science and Technology Volume: 03, Special Issue No: 01, February 2017 ISSN: 2455-3778 http://www.ijmtst.com Efficient Adjacent Neighbor Expansion Search Keyword

More information

Database Applications (15-415)

Database Applications (15-415) Database Applications (15-415) DBMS Internals- Part V Lecture 13, March 10, 2014 Mohammad Hammoud Today Welcome Back from Spring Break! Today Last Session: DBMS Internals- Part IV Tree-based (i.e., B+

More information

arxiv: v1 [cs.db] 30 May 2012

arxiv: v1 [cs.db] 30 May 2012 : Spatio-Textual Similarity Search Ju Fan Guoliang Li Lizhu Zhou Shanshan Chen Jun Hu Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology

More information

Multiterm Keyword Searching For Key Value Based NoSQL System

Multiterm Keyword Searching For Key Value Based NoSQL System Multiterm Keyword Searching For Key Value Based NoSQL System Pallavi Mahajan 1, Arati Deshpande 2 Department of Computer Engineering, PICT, Pune, Maharashtra, India. Pallavinarkhede88@gmail.com 1, ardeshpande@pict.edu

More information

An Efficient Language Interoperability based Search Engine for Mobile Users 1 Pilli Srivalli, 2 P.S.Sitarama Raju

An Efficient Language Interoperability based Search Engine for Mobile Users 1 Pilli Srivalli, 2 P.S.Sitarama Raju An Efficient Language Interoperability based Search Engine for Mobile Users 1 Pilli Srivalli, 2 P.S.Sitarama Raju 1 Final M.Tech Student, 2 Professor 1,2 Dept of CSE, MVGR college of Engineering,Chintavalasa,AP,India.

More information

Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm

Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm Rekha Jain 1, Sulochana Nathawat 2, Dr. G.N. Purohit 3 1 Department of Computer Science, Banasthali University, Jaipur, Rajasthan ABSTRACT

More information

Ranking Web Pages by Associating Keywords with Locations

Ranking Web Pages by Associating Keywords with Locations Ranking Web Pages by Associating Keywords with Locations Peiquan Jin, Xiaoxiang Zhang, Qingqing Zhang, Sheng Lin, and Lihua Yue University of Science and Technology of China, 230027, Hefei, China jpq@ustc.edu.cn

More information

Best Keyword Cover Search

Best Keyword Cover Search Vennapusa Mahesh Kumar Reddy Dept of CSE, Benaiah Institute of Technology and Science. Best Keyword Cover Search Sudhakar Babu Pendhurthi Assistant Professor, Benaiah Institute of Technology and Science.

More information

TwigList: Make Twig Pattern Matching Fast

TwigList: Make Twig Pattern Matching Fast TwigList: Make Twig Pattern Matching Fast Lu Qin, Jeffrey Xu Yu, and Bolin Ding The Chinese University of Hong Kong, China {lqin,yu,blding}@se.cuhk.edu.hk Abstract. Twig pattern matching problem has been

More information

IALP 2016 Improving the Effectiveness of POI Search by Associated Information Summarization

IALP 2016 Improving the Effectiveness of POI Search by Associated Information Summarization IALP 2016 Improving the Effectiveness of POI Search by Associated Information Summarization Hsiu-Min Chuang, Chia-Hui Chang*, Chung-Ting Cheng Dept. of Computer Science and Information Engineering National

More information

Keywords APSE: Advanced Preferred Search Engine, Google Android Platform, Search Engine, Click-through data, Location and Content Concepts.

Keywords APSE: Advanced Preferred Search Engine, Google Android Platform, Search Engine, Click-through data, Location and Content Concepts. Volume 5, Issue 3, March 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Advanced Preferred

More information

Improving the Performance of Search Engine With Respect To Content Mining Kr.Jansi, L.Radha

Improving the Performance of Search Engine With Respect To Content Mining Kr.Jansi, L.Radha Improving the Performance of Search Engine With Respect To Content Mining Kr.Jansi, L.Radha 1 Asst. Professor, Srm University, Chennai 2 Mtech, Srm University, Chennai Abstract R- Google is a dedicated

More information

Nearest Neighbour Expansion Using Keyword Cover Search

Nearest Neighbour Expansion Using Keyword Cover Search Nearest Neighbour Expansion Using Keyword Cover Search [1] P. Sai Vamsi Aravind MTECH(CSE) Institute of Aeronautical Engineering, Hyderabad [2] P.Anjaiah Assistant Professor Institute of Aeronautical Engineering,

More information

Efficient NKS Queries Search in Multidimensional Dataset through Projection and Multi-Scale Hashing Scheme

Efficient NKS Queries Search in Multidimensional Dataset through Projection and Multi-Scale Hashing Scheme Efficient NKS Queries Search in Multidimensional Dataset through Projection and Multi-Scale Hashing Scheme 1 N.NAVEEN KUMAR, 2 YASMEEN ANJUM 1 Assistant Professor, Department of CSE, School of Information

More information

CS301 - Data Structures Glossary By

CS301 - Data Structures Glossary By CS301 - Data Structures Glossary By Abstract Data Type : A set of data values and associated operations that are precisely specified independent of any particular implementation. Also known as ADT Algorithm

More information

International Journal of Scientific & Engineering Research, Volume 6, Issue 10, October ISSN

International Journal of Scientific & Engineering Research, Volume 6, Issue 10, October ISSN International Journal of Scientific & Engineering Research, Volume 6, Issue 10, October-2015 726 Performance Validation of the Modified K- Means Clustering Algorithm Clusters Data S. Govinda Rao Associate

More information

MULTIDIMENSIONAL INDEXING TREE STRUCTURE FOR SPATIAL DATABASE MANAGEMENT

MULTIDIMENSIONAL INDEXING TREE STRUCTURE FOR SPATIAL DATABASE MANAGEMENT MULTIDIMENSIONAL INDEXING TREE STRUCTURE FOR SPATIAL DATABASE MANAGEMENT Dr. G APPARAO 1*, Mr. A SRINIVAS 2* 1. Professor, Chairman-Board of Studies & Convener-IIIC, Department of Computer Science Engineering,

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

Optimize Space Search Using FCC_STF Algorithm in Fuzzy Co-Clustering through Search Engine

Optimize Space Search Using FCC_STF Algorithm in Fuzzy Co-Clustering through Search Engine Optimize Space Search Using FCC_STF Algorithm in Fuzzy Co-Clustering through Search Engine Monika Rani, Satendra Kumar, Vinod Kumar Yadav Abstract Fuzzy co-clustering can be improved if we handle two main

More information

Introduction to Indexing R-trees. Hong Kong University of Science and Technology

Introduction to Indexing R-trees. Hong Kong University of Science and Technology Introduction to Indexing R-trees Dimitris Papadias Hong Kong University of Science and Technology 1 Introduction to Indexing 1. Assume that you work in a government office, and you maintain the records

More information

A Survey on Information Extraction in Web Searches Using Web Services

A Survey on Information Extraction in Web Searches Using Web Services A Survey on Information Extraction in Web Searches Using Web Services Maind Neelam R., Sunita Nandgave Department of Computer Engineering, G.H.Raisoni College of Engineering and Management, wagholi, India

More information

Maintenance of the Prelarge Trees for Record Deletion

Maintenance of the Prelarge Trees for Record Deletion 12th WSEAS Int. Conf. on APPLIED MATHEMATICS, Cairo, Egypt, December 29-31, 2007 105 Maintenance of the Prelarge Trees for Record Deletion Chun-Wei Lin, Tzung-Pei Hong, and Wen-Hsiang Lu Department of

More information

A Comparison of Data Structures for Dijkstra's Single Source Shortest Path Algorithm. Shane Saunders

A Comparison of Data Structures for Dijkstra's Single Source Shortest Path Algorithm. Shane Saunders A Comparison of Data Structures for Dijkstra's Single Source Shortest Path Algorithm Shane Saunders November 5, 1999 Abstract Dijkstra's algorithm computes the shortest paths between a starting vertex

More information

ISSN: (Online) Volume 2, Issue 4, April 2014 International Journal of Advance Research in Computer Science and Management Studies

ISSN: (Online) Volume 2, Issue 4, April 2014 International Journal of Advance Research in Computer Science and Management Studies ISSN: 2321-7782 (Online) Volume 2, Issue 4, April 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Paper / Case Study Available online at: www.ijarcsms.com

More information

Basic Tokenizing, Indexing, and Implementation of Vector-Space Retrieval

Basic Tokenizing, Indexing, and Implementation of Vector-Space Retrieval Basic Tokenizing, Indexing, and Implementation of Vector-Space Retrieval 1 Naïve Implementation Convert all documents in collection D to tf-idf weighted vectors, d j, for keyword vocabulary V. Convert

More information

Spatially-Aware Information Retrieval on the Internet

Spatially-Aware Information Retrieval on the Internet Spatially-Aware Information Retrieval on the Internet SPIRIT is funded by EU IST Programme Contract Number: Abstract Multi-Attribute Similarity Ranking Deliverable number: D17:5301 Deliverable type: R

More information

Enhancing the Efficiency of Radix Sort by Using Clustering Mechanism

Enhancing the Efficiency of Radix Sort by Using Clustering Mechanism Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,

More information

Data Access on Wireless Broadcast Channels using Keywords

Data Access on Wireless Broadcast Channels using Keywords Data Access on Wireless Broadcast Channels using Keywords Mr. Vijaykumar Mantri 1, Mr. Nagaraju A 2 Dept of IT, Padmasri Dr. B. V. Raju Institute of Technology, Narsapur, Dist.Medak, Andhra Pradesh, India.

More information

HYBRID GEO-TEXTUAL INDEX STRUCTURE FOR SPATIAL RANGE KEYWORD SEARCH

HYBRID GEO-TEXTUAL INDEX STRUCTURE FOR SPATIAL RANGE KEYWORD SEARCH HYBRID GEO-TEXTUAL INDEX STRUCTURE FOR SPATIAL RANGE KEYWORD SEARCH Su Nandar Aung 1 and Myint Mint Sein 2 1 University of Computer Studies, Yangon, Myanmar 2 Research and Development Department, University

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Postfix (and prefix) notation

Postfix (and prefix) notation Postfix (and prefix) notation Also called reverse Polish reversed form of notation devised by mathematician named Jan Łukasiewicz (so really lü-kä-sha-vech notation) Infix notation is: operand operator

More information

Differentially Private H-Tree

Differentially Private H-Tree GeoPrivacy: 2 nd Workshop on Privacy in Geographic Information Collection and Analysis Differentially Private H-Tree Hien To, Liyue Fan, Cyrus Shahabi Integrated Media System Center University of Southern

More information

A Novel Approach for Restructuring Web Search Results by Feedback Sessions Using Fuzzy clustering

A Novel Approach for Restructuring Web Search Results by Feedback Sessions Using Fuzzy clustering A Novel Approach for Restructuring Web Search Results by Feedback Sessions Using Fuzzy clustering R.Dhivya 1, R.Rajavignesh 2 (M.E CSE), Department of CSE, Arasu Engineering College, kumbakonam 1 Asst.

More information

Query Processing & Optimization

Query Processing & Optimization Query Processing & Optimization 1 Roadmap of This Lecture Overview of query processing Measures of Query Cost Selection Operation Sorting Join Operation Other Operations Evaluation of Expressions Introduction

More information

Survey on Recommendation of Personalized Travel Sequence

Survey on Recommendation of Personalized Travel Sequence Survey on Recommendation of Personalized Travel Sequence Mayuri D. Aswale 1, Dr. S. C. Dharmadhikari 2 ME Student, Department of Information Technology, PICT, Pune, India 1 Head of Department, Department

More information

ACM-ICPC Indonesia National Contest Problem A. The Best Team. Time Limit: 2s

ACM-ICPC Indonesia National Contest Problem A. The Best Team. Time Limit: 2s Problem A The Best Team Time Limit: 2s ACM-ICPC 2010 is drawing near and your university want to select three out of N students to form the best team. The university however, has a limited budget, so they

More information

A NOVEL SECURED BOOLEAN BASED SECRET IMAGE SHARING SCHEME

A NOVEL SECURED BOOLEAN BASED SECRET IMAGE SHARING SCHEME VOL 13, NO 13, JULY 2018 ISSN 1819-6608 2006-2018 Asian Research Publishing Network (ARPN) All rights reserved wwwarpnjournalscom A NOVEL SECURED BOOLEAN BASED SECRET IMAGE SHARING SCHEME Javvaji V K Ratnam

More information

Question Bank Subject: Advanced Data Structures Class: SE Computer

Question Bank Subject: Advanced Data Structures Class: SE Computer Question Bank Subject: Advanced Data Structures Class: SE Computer Question1: Write a non recursive pseudo code for post order traversal of binary tree Answer: Pseudo Code: 1. Push root into Stack_One.

More information

A Parallel Access Method for Spatial Data Using GPU

A Parallel Access Method for Spatial Data Using GPU A Parallel Access Method for Spatial Data Using GPU Byoung-Woo Oh Department of Computer Engineering Kumoh National Institute of Technology Gumi, Korea bwoh@kumoh.ac.kr Abstract Spatial access methods

More information

A Perceptual Model Based on Computational Features for Texture Representation and Retrieval

A Perceptual Model Based on Computational Features for Texture Representation and Retrieval A Perceptual Model Based on Computational Features for Texture Representation and Retrieval 1 K.N.Sindhuri, 2 N.Leelavathy, 3 B.Srinivas 1,2,3 Dept. of CSE, Pragati Engineering College, Surampalem, AP,

More information

DDS Dynamic Search Trees

DDS Dynamic Search Trees DDS Dynamic Search Trees 1 Data structures l A data structure models some abstract object. It implements a number of operations on this object, which usually can be classified into l creation and deletion

More information

Query Evaluation Strategies

Query Evaluation Strategies Introduction to Search Engine Technology Term-at-a-Time and Document-at-a-Time Evaluation Ronny Lempel Yahoo! Research (Many of the following slides are courtesy of Aya Soffer and David Carmel, IBM Haifa

More information

Web Information Retrieval using WordNet

Web Information Retrieval using WordNet Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT

More information

Mining Rare Periodic-Frequent Patterns Using Multiple Minimum Supports

Mining Rare Periodic-Frequent Patterns Using Multiple Minimum Supports Mining Rare Periodic-Frequent Patterns Using Multiple Minimum Supports R. Uday Kiran P. Krishna Reddy Center for Data Engineering International Institute of Information Technology-Hyderabad Hyderabad,

More information

A Survey on Efficient Location Tracker Using Keyword Search

A Survey on Efficient Location Tracker Using Keyword Search A Survey on Efficient Location Tracker Using Keyword Search Prasad Prabhakar Joshi, Anand Bone ME Student, Smt. Kashibai Navale Sinhgad Institute of Technology and Science Kusgaon (Budruk), Lonavala, Pune,

More information

Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach

Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach P.T.Shijili 1 P.G Student, Department of CSE, Dr.Nallini Institute of Engineering & Technology, Dharapuram, Tamilnadu, India

More information

Knowledge discovery from XML Database

Knowledge discovery from XML Database Knowledge discovery from XML Database Pravin P. Chothe 1 Prof. S. V. Patil 2 Prof.S. H. Dinde 3 PG Scholar, ADCET, Professor, ADCET Ashta, Professor, SGI, Atigre, Maharashtra, India Maharashtra, India

More information

Performance Evaluation of Gene based Ontology Using Attribute Selection Methods

Performance Evaluation of Gene based Ontology Using Attribute Selection Methods Performance Evaluation of Gene based Ontology Using Attribute Selection Methods Ch. Uma Shankari #1 T. Sudha Rani *2 # M. Tech. Student Department of Computer Science and Engineering Aditya Engineering

More information

REMOVAL OF REDUNDANT AND IRRELEVANT DATA FROM TRAINING DATASETS USING SPEEDY FEATURE SELECTION METHOD

REMOVAL OF REDUNDANT AND IRRELEVANT DATA FROM TRAINING DATASETS USING SPEEDY FEATURE SELECTION METHOD Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,

More information

A Semantic Model for Concept Based Clustering

A Semantic Model for Concept Based Clustering A Semantic Model for Concept Based Clustering S.Saranya 1, S.Logeswari 2 PG Scholar, Dept. of CSE, Bannari Amman Institute of Technology, Sathyamangalam, Tamilnadu, India 1 Associate Professor, Dept. of

More information

Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey

Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey G. Shivaprasad, N. V. Subbareddy and U. Dinesh Acharya

More information

Navigation Cost Modeling Based On Ontology

Navigation Cost Modeling Based On Ontology IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661 Volume 4, Issue 3 (Sep.-Oct. 2012), PP 34-39 Navigation Cost Modeling Based On Ontology 1 Madala Venkatesh, 2 Dr.R.V.Krishnaiah 1 Department

More information

A Note on Scheduling Parallel Unit Jobs on Hypercubes

A Note on Scheduling Parallel Unit Jobs on Hypercubes A Note on Scheduling Parallel Unit Jobs on Hypercubes Ondřej Zajíček Abstract We study the problem of scheduling independent unit-time parallel jobs on hypercubes. A parallel job has to be scheduled between

More information

MULTI - KEYWORD RANKED SEARCH OVER ENCRYPTED DATA SUPPORTING SYNONYM QUERY

MULTI - KEYWORD RANKED SEARCH OVER ENCRYPTED DATA SUPPORTING SYNONYM QUERY ISSN: 0976-3104 SPECIAL ISSUE Jayanthi and Prabadevi RESEARCH OPEN ACCESS MULTI - KEYWORD RANKED SEARCH OVER ENCRYPTED DATA SUPPORTING SYNONYM QUERY Jayanthi M.* and Prabadevi School of Information Technology

More information

6. Dicretization methods 6.1 The purpose of discretization

6. Dicretization methods 6.1 The purpose of discretization 6. Dicretization methods 6.1 The purpose of discretization Often data are given in the form of continuous values. If their number is huge, model building for such data can be difficult. Moreover, many

More information

Efficient Construction of Safe Regions for Moving knn Queries Over Dynamic Datasets

Efficient Construction of Safe Regions for Moving knn Queries Over Dynamic Datasets Efficient Construction of Safe Regions for Moving knn Queries Over Dynamic Datasets Mahady Hasan, Muhammad Aamir Cheema, Xuemin Lin, Ying Zhang The University of New South Wales, Australia {mahadyh,macheema,lxue,yingz}@cse.unsw.edu.au

More information

ALGORITHM FOR MINING TIME VARYING FREQUENT ITEMSETS

ALGORITHM FOR MINING TIME VARYING FREQUENT ITEMSETS ALGORITHM FOR MINING TIME VARYING FREQUENT ITEMSETS D.SUJATHA 1, PROF.B.L.DEEKSHATULU 2 1 HOD, Department of IT, Aurora s Technological and Research Institute, Hyderabad 2 Visiting Professor, Department

More information

June 15, Abstract. 2. Methodology and Considerations. 1. Introduction

June 15, Abstract. 2. Methodology and Considerations. 1. Introduction Organizing Internet Bookmarks using Latent Semantic Analysis and Intelligent Icons Note: This file is a homework produced by two students for UCR CS235, Spring 06. In order to fully appreacate it, it may

More information

Data Structure Advanced

Data Structure Advanced Data Structure Advanced 1. Is it possible to find a loop in a Linked list? a. Possilbe at O(n) b. Not possible c. Possible at O(n^2) only d. Depends on the position of loop Solution: a. Possible at O(n)

More information

RASIM: a rank-aware separate index method for answering top-k spatial keyword queries

RASIM: a rank-aware separate index method for answering top-k spatial keyword queries World Wide Web (2013) 16:111 139 DOI 10.1007/s11280-012-0159-3 RASIM: a rank-aware separate index method for answering top-k spatial keyword queries Hyuk-Yoon Kwon Kyu-Young Whang Il-Yeol Song Haixun Wang

More information

Concept Tree Based Clustering Visualization with Shaded Similarity Matrices

Concept Tree Based Clustering Visualization with Shaded Similarity Matrices Syracuse University SURFACE School of Information Studies: Faculty Scholarship School of Information Studies (ischool) 12-2002 Concept Tree Based Clustering Visualization with Shaded Similarity Matrices

More information

Rule-Based Method for Entity Resolution Using Optimized Root Discovery (ORD)

Rule-Based Method for Entity Resolution Using Optimized Root Discovery (ORD) American-Eurasian Journal of Scientific Research 12 (5): 255-259, 2017 ISSN 1818-6785 IDOSI Publications, 2017 DOI: 10.5829/idosi.aejsr.2017.255.259 Rule-Based Method for Entity Resolution Using Optimized

More information

End-Term Examination Second Semester [MCA] MAY-JUNE 2006

End-Term Examination Second Semester [MCA] MAY-JUNE 2006 (Please write your Roll No. immediately) Roll No. Paper Code: MCA-102 End-Term Examination Second Semester [MCA] MAY-JUNE 2006 Subject: Data Structure Time: 3 Hours Maximum Marks: 60 Note: Question 1.

More information

Batch processing of Top-k Spatial-textual Queries

Batch processing of Top-k Spatial-textual Queries Batch processing of Top-k Spatial-textual Queries FARHANA M. CHOUDHURY, School of Science, RMIT University, Australia J. SHANE CULPEPPER, School of Science, RMIT University, Australia ZHIFENG BAO, School

More information

Uncertain Data Classification Using Decision Tree Classification Tool With Probability Density Function Modeling Technique

Uncertain Data Classification Using Decision Tree Classification Tool With Probability Density Function Modeling Technique Research Paper Uncertain Data Classification Using Decision Tree Classification Tool With Probability Density Function Modeling Technique C. Sudarsana Reddy 1 S. Aquter Babu 2 Dr. V. Vasu 3 Department

More information

EFFECTIVE EFFICIENT BOOLEAN RETRIEVAL

EFFECTIVE EFFICIENT BOOLEAN RETRIEVAL EFFECTIVE EFFICIENT BOOLEAN RETRIEVAL J Naveen Kumar 1, Dr. M. Janga Reddy 2 1 jnaveenkumar6@gmail.com, 2 pricipalcmrit@gmail.com 1 M.Tech Student, Department of Computer Science, CMR Institute of Technology,

More information

A Novel Method to Estimate the Route and Travel Time with the Help of Location Based Services

A Novel Method to Estimate the Route and Travel Time with the Help of Location Based Services A Novel Method to Estimate the Route and Travel Time with the Help of Location Based Services M.Uday Kumar Associate Professor K.Pradeep Reddy Associate Professor S Navaneetha M.Tech Student Abstract Location-based

More information

Distributed Bottom up Approach for Data Anonymization using MapReduce framework on Cloud

Distributed Bottom up Approach for Data Anonymization using MapReduce framework on Cloud Distributed Bottom up Approach for Data Anonymization using MapReduce framework on Cloud R. H. Jadhav 1 P.E.S college of Engineering, Aurangabad, Maharashtra, India 1 rjadhav377@gmail.com ABSTRACT: Many

More information

Best Keyword Cover Search Using Distance and Rating

Best Keyword Cover Search Using Distance and Rating Best Keyword Cover Search Using Distance and Rating Mr. Vishal D. Kolekar, Prof. Ajay Kumar Gupta Department of Computer Engineering, SPs Institute Of Knowledge College Of Engineering, Savitribai Phule

More information

Nearest Keyword Set Search In Multi- Dimensional Datasets

Nearest Keyword Set Search In Multi- Dimensional Datasets Nearest Keyword Set Search In Multi- Dimensional Datasets 1 R. ANITHA, 2 R. JAYA SUNDARI, 3 V. KANIMOZHI, 4 K. MUMTAJ BEGAM 5 Mr. D.SATHYAMURTHY ME 1,2,3,4 Students, 5 Assistant Professor U.G Scholar MRK

More information

MSRA Columbus at GeoCLEF2007

MSRA Columbus at GeoCLEF2007 MSRA Columbus at GeoCLEF2007 Zhisheng Li 1, Chong Wang 2, Xing Xie 2, Wei-Ying Ma 2 1 Department of Computer Science, University of Sci. & Tech. of China, Hefei, Anhui, 230026, P.R. China zsli@mail.ustc.edu.cn

More information