A NOVEL APPROACH ON SPATIAL OBJECTS FOR OPTIMAL ROUTE SEARCH USING BEST KEYWORD COVER QUERY

Similar documents
Closest Keywords Search on Spatial Databases

Best Keyword Cover Search

Efficient Adjacent Neighbor Expansion Search Keyword

A Survey on Efficient Location Tracker Using Keyword Search

PG Student, Department of MCA, St. Ann s College of Engineering & Technology, Chirala, Andhra Pradesh, India

ISSN: (Online) Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies

Secure and Advanced Best Keyword Cover Search over Spatial Database

Nearest Neighbour Expansion Using Keyword Cover Search

NOVEL CACHE SEARCH TO SEARCH THE KEYWORD COVERS FROM SPATIAL DATABASE

USING KEYWORD-NNE ALGORITHM

Efficient Index Based Query Keyword Search in the Spatial Database

HYBRID GEO-TEXTUAL INDEX STRUCTURE FOR SPATIAL RANGE KEYWORD SEARCH

Best Keyword Cover Search Using Distance and Rating

Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index

Efficient NKS Queries Search in Multidimensional Dataset through Projection and Multi-Scale Hashing Scheme

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

A Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods

Optimal Enhancement of Location Aware Spatial Keyword Cover

Best Keyword Cover Search

Ranking Web Pages by Associating Keywords with Locations

Inverted Index for Fast Nearest Neighbour

Top-k Spatial Keyword Preference Query

Spatial Index Keyword Search in Multi- Dimensional Database

EFFECTIVELY MINIMIZE THE OVERALL TRIP DISTANCE USING CONTINUOUS DETOUR QUERY IN SPATIAL NETWORK

Spatial Keyword Search. Presented by KWOK Chung Hin, WONG Kam Kwai

Top-k Keyword Search Over Graphs Based On Backward Search

Best Keyword Cover Search Using Spatial Data

Improved Framework Of Location Aware Keyword Query Suggestion (LKS) In Geo-Based Social Applications

An Edge-Based Algorithm for Spatial Query Processing in Real-Life Road Networks

Keyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan

Batch processing of Top-k Spatial-textual Queries

A SURVEY ON SEARCHING SPATIO-TEXTUAL TOP K-QUERIES BY REVERSE KEYWORD

A Novel Approach for Minimum Spanning Tree Based Clustering Algorithm

Results and Discussions on Transaction Splitting Technique for Mining Differential Private Frequent Itemsets

Detect tracking behavior among trajectory data

Fast K-nearest neighbors searching algorithms for point clouds data of 3D scanning system 1

User-Contributed Relevance and Nearest Neighbor Queries

DRIVEN by mobile computing, location-based services

Using Center Representation and Variance Effect on K-NN Classification

QUERY REGION DETERMINATION BASED ON REGION IMPORTANCE INDEX AND RELATIVE POSITION FOR REGION-BASED IMAGE RETRIEVAL

Multicore based Spatial k-dominant Skyline Computation

Tag Based Image Search by Social Re-ranking

Information Retrieval. Wesley Mathew

Nearest Keyword Set Search In Multi- Dimensional Datasets

Computing Continuous Skyline Queries without Discriminating between Static and Dynamic Attributes

The Flexible Group Spatial Keyword Query

Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis

Dr. K. Velmurugan 1, Miss. N. Meenatchi 2 1 Assistant Professor Department of Computer Science, Govt.Arts and Science College, Uthiramerur

Continuous Monitoring of Top-k Spatial Keyword Queries in Road Networks *

IMPLEMENTATION AND COMPARATIVE STUDY OF IMPROVED APRIORI ALGORITHM FOR ASSOCIATION PATTERN MINING

A Novel Method to Estimate the Route and Travel Time with the Help of Location Based Services

Performance evaluation of searching using various indexing techniques in Lucene with Relational Databases

Correspondence Clustering: An Approach to Cluster Multiple Related Spatial Datasets

Analysis of Extended Performance for clustering of Satellite Images Using Bigdata Platform Spark

Efficient Processing of Top-k Spatial Preference Queries

Social Behavior Prediction Through Reality Mining

Enhancing Spatial Inverted Index Technique for Keyword Searching With High Dimensional Data

Similarity Search: A Matching Based Approach

Distance-based Outlier Detection: Consolidation and Renewed Bearing

ST-HBase: A Scalable Data Management System for Massive Geo-tagged Objects

ERCIM Alain Bensoussan Fellowship Scientific Report

Survey of Spatial Approximate String Search

Smartcrawler: A Two-stage Crawler Novel Approach for Web Crawling

REDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM. Pudukkottai, Tamil Nadu, India

Trajectory Compression under Network Constraints

Efficient processing of top-k spatial Boolean queries for distributed system

Building a low-latency, proximity-aware DHT-based P2P network

Spatiotemporal Access to Moving Objects. Hao LIU, Xu GENG 17/04/2018

Design of student information system based on association algorithm and data mining technology. CaiYan, ChenHua

SPATIAL INVERTED INDEX BY USING FAST NEAREST NEIGHBOR SEARCH

A Survey on Nearest Neighbor Search with Keywords

Indexing Land Surface for Efficient knn Query

Multidirectional 2DPCA Based Face Recognition System

Survey of Efficient and Fast Nearest Neighbor Search For Spatial Query on Multidimensional Data

Subspace Discovery for Promotion: A Cell Clustering Approach

Goal Directed Relative Skyline Queries in Time Dependent Road Networks

Top-K Ranking Spatial Queries over Filtering Data

Querying Shortest Distance on Large Graphs

A Novel Framework to Measure the Degree of Difficulty on Keyword Query Routing

modern database systems lecture 5 : top-k retrieval

k-dominant and Extended k-dominant Skyline Computation by Using Statistics

Contour-Based Large Scale Image Retrieval

Distributed Skyline Processing: a Trend in Database Research Still Going Strong

Friend Recommendation by Using Skyline Query and Location Information

An Enhanced Face Recognition System based on Rotated Two Dimensional Principal Components

APPROXIMATE PATTERN MATCHING WITH RULE BASED AHO-CORASICK INDEX M.Priya 1, Dr.R.Kalpana 2 1

MobiPLACE*: A Distributed Framework for Spatio-Temporal Data Streams Processing Utilizing Mobile Clients Processing Power.

Project Participants

Open Access Research on the Prediction Model of Material Cost Based on Data Mining

K-modes Clustering Algorithm for Categorical Data

Circle of Friend Query in Geo-Social Networks

Closest Keyword Retrieval with Data Mining Approach

Literature Survey on Various Recommendation Techniques in Collaborative Filtering

Study and Analysis of Recommendation Systems for Location Based Social Network (LBSN)

Incremental calculation of isochrones regarding duration

A Reduce Identical Composite Event Transmission Algorithm for Wireless Sensor Networks

Efficient Top-k Spatial Distance Joins

Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme

Spatial Data Ranking through Skyline Queries

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data

Transcription:

A NOVEL APPROACH ON SPATIAL OBJECTS FOR OPTIMAL ROUTE SEARCH USING BEST KEYWORD COVER QUERY S.Shiva Reddy *1 P.Ajay Kumar *2 *12 Lecterur,Dept of CSE JNTUH-CEH Abstract Optimal route search using spatial keyword query focus on keyword searching using best keyword cover query which is a form of spatial keyword query. It operates on spatial objects stored in spatial database and comes with algorithms that can retrieve results very fast. Best keyword cover query aims to find objects associated with keywords. The method proposed considers keyword rating, keyword relevance and spatial relevance. It also helps to retrieve data based on Boolean range query. Keywords Spatial keyword query, spatial objects, spatial database, best keyword cover query. I. INTRODUCTION Data mining is the means of extracting data from a dataset for users to use it in various purposes. The purpose of such data plays a significant role in keyword searching. Searching is a common activity happening in data mining. Searching for spatial objects from spatial database has recently sparked enthusiasm among researchers. This motivated to develop methods to retrieve spatial objects. A spatial object consists of objects associated with spatial features. In other words, spatial objects involve spatial data along with longitude and latitude of location. Querying such data is called best keyword cover querying. Search is called best keyword cover search. Existing method to such data consider either minimum inter objective distance and keyword search. As a result new methods for best keyword cover search were developed. Traditional nearest neighbor search compute nearest neighbor by considering distance as feature. In this context, nearest neighbor search focus on finding nearest neighbors where keywords and spatial data plays a major impact. II. LITERATURE SURVEY Keyword augmented nearest neighbor search has recently sparked interest among researchers. Ke Deng [1] comes with algorithms to find nearest neighbor using keywords. Joao B Rocha [2] proposed spatial inverted index, a variant of inverted index to store keywords. Xin Cao [3] proposed the concept of collective spatial keyword querying. The central idea is to search for collective objects that collectively satisfy a query. Nearest neighbor search also comes under category of searching process. With this concept in mind, Gisli R [4] proposed distance browsing algorithm in spatial databases. Ronald Fagin [5] dealt with optimal aggregation algorithm which helps in fast keyword search.yufei Tao [6] proposed method for finding nearest neighbors using tree structure as index. Lisi Chen [7] provide a survey of indices to store keywords as well as spatial location. Xin Cao [8] dealt with various spatial keyword queries. The concept of Boolean range query fall under the category of spatial keyword query. DongXiang Zhang [9] proposed scalable integrated inverted index for storing spatial data. Bolin Ding [10] provides method to efficiently process keyword queries. Shuyao [11] proposed the concept of keyword query. Jianhua [12] considered a form of index named keyword pair based structure for finding top k answers using keyword search.. III. PROPOSED METHOD A. BASELINE ALGORITHM With increase in keywords for keyword search and nearest neighbor search, there is a need to perform query on such data. Baseline algorithm focus on retrieving data with respect to query keywords. Best keyword cover can be obtained by baseline algorithm. Baseline algorithm requires spatial objects in the form of files which include fields like spatial location and its document identifier and its address. Spatial objects are objects obtained from spatial data. All operations revolve around spatial objects. Input to baseline algorithm require single query keyword in the form of objects. The first step in baseline algorithm is to set a variable bkc as zero. The next @IJRTER-2017, All Rights Reserved 25

step is to generate candidate keyword cover. Candidate keyword cover generate spatial objects that contain those query keywords. Keyword significance has been calculated using term frequency inverse document frequency as similarity measure. Term frequency is the number of times a term occur in a single document divided by total number of terms in a document. Inverse document frequency is the inverse of number of times a term occurs in a document divided by total number of documents. Term frequency inverse document frequency is the product of term frequency and inverse document frequency. The default value is set as zero. The score obtained is compared with first score. If its value is greater than zero, it has been set as best keyword cover. Score calculation can be obtained as a pruning strategy. The next step is to perform nearest neighbor search upon candidate keyword covers generated. Nearest neighbor search algorithm has been computed using a traditional similarity measure named Euclidean distance. This similarity measure is based on distance. Nearest neighbor search algorithm sets its default value in terms of user s current user location. Based on that location, rest of distance with respect to that location has been calculated. The one least distance with respect to query location has been considered best keyword cover. When number of query keywords increases, its performance drops. It running time is very high. B. Keyword nearest neighbor expansion variant algorithm Baseline algorithm only retrieve answer to single query. It does not respond to multiple query keywords. It does not respond to spatial features. To overcome the drawback, the author proposed a variant of nearest neighbor algorithm called keyword nearest neighbor expansion variant. This algorithm focus on retrieving nearest neighbor by combining both keyword search and nearest neighbor search. The new algorithm introduce the concept of keyword rating, spatial relevance and keyword relevance. Keyword rating helps in decision making. For better decision making keyword rating play a significant role. Since all performance operations depend on objects, there exist a problem of choosing which objects first for querying when given multiple features of different objects. For this purpose keyword rating has been associated with objects. Rating is based day to day importance of object in daily life. Rating takes value of integer ranging from 1 to 5. This algorithm not only involve keyword rating but also involve features of objects as well. Objects must be selected to add features. Input to keyword nearest neighbor expansion variant algorithm is a set of query keywords in the form of features associated with objects. The first step is to select principle query keyword to perform search. In other words, to identify the first object in which feature has been associated for searching. Objects linked with principle query keyword are called principle objects. Indexing has been used to find required object associated with keyword. After identifying the object, it search for objects having highest keyword rating. The one with highest keyword rating are usually set as the first object in which search has to be carried out. IV. RESULT ANALYSIS The proposed method comes with two algorithms. Our experiment is based on real data. The dimensionality is always 2. Baseline algorithm applied on real data focus on retrieving data using single query keyword. Keyword nearest neighbor expansion variant algorithm retrieve data using multiple query keyword. Fig 1 shows bar chart representing execution time of baseline algorithm versus dataset count or number of files searched for a particular query keyword. Vertical axis indicate execution time during search process. Execution time is the difference between start time and run time when search procedure occur. When a single query keyword is searched in a file of thirty four, its execution time is 8433 milliseconds. Similarly, when searching takes in a file count of three for one query keyword, its execution time is 508 milliseconds. When searching takes place in a file count of twenty seven, execution time is 5071 milliseconds. It has been observed that execution time increases as files to be searched increases. File count is linearly proportional to execution time. Fig. 1 Execution time versus dataset count of baseline algorithm @IJRTER-2017, All Rights Reserved 26

Fig 2 shows bar chart indicating memory consumption of baseline graph. X axis indicate memory consumption time and Y axis indicate number of file or dataset count. When file searched is thirty four, memory required is 53727776 KB. When file searched is three for a particular query keyword, memory required is 41531088 KB. When file searched is twenty seven for a particular query keyword, memory required is 53282048 KB.. From this graph, it has been shown that as number of files searched increases memory required increases. So number of files searched is directly proportional to memory consumption. memory consumption required is 38417360 KB. When keyword cover count is two, memory consumption required is 41220920 KB. When keyword cover count is three, memory consumption required is 37853264 KB. As keyword cover count increases, memory consumption time varies. Fig. 4 Memory consumption versus keyword cover count of KNNE Variant algorithm From the experiments conducted, it has been observed that execution time required by keyword nearest neighbor expansion variant algorithm is much faster compared to baseline algorithm. V. CONCLUSION: Fig. 2 Memory consumption versus dataset count of baseline algorithm Fig 3 shows bar chart representing execution time versus keyword cover count of keyword nearest neighbor expansion variant algorithm. When keyword cover count is one, its execution time is 82 milliseconds. When keyword cover count is two, its execution time is 121 milliseconds. When keyword cover count is three, its execution time is 127 milliseconds. Fig. 3 Execution time versus keyword cover count of KNNE Variant algorithm Fig 4 shows bar chart indicating memory consumption required while executing multiple query keyword using keyword nearest neighbor expansion variant algorithm. When keyword cover count is one, A detailed report of two algorithms to retrieve best keyword cover was presented. Best keyword cover query aims to recover spatial objects with respect to user s requirement. Algorithms are used to find answer to such query. It also comes with best keyword cover route search which helps to find best route. REFERENCES [1] Ke Deng, Xin Li and Xiaofang Zhou, Best Keyword Cover Search, IEEE Transaction on Knowledge and Data Engineering,vol 27, no 1, January 2015. [2] Joao B Junior, Orestis Gkorghas, Simon Jonassen and Kjetil Norvag, Efficient Processing of Top K Spatial Keyword Queries, in SSTD, pages 205-222, 2012. [3] Xin Cao, Gao Cong, Beng Chin, Collective Spatial Keyword Querying, ACM Transaction on Database Systems, 2011. [4] Gilsi R Hjaltson and Hanseb Samet, Distance Browsing in Spatial Databases, ACM Transaction on Database Systems, June 1999, pp 265-318. [5] Ronald Fagin, Ammon Lotem and Moni Naor, Optimal Aggregation Algorithms for Middleware, Journal of Computer and System Sciences, April 2003. [6] Yufei Tao and Cheng Sheng, Fast Nearest Neighbor Search with Keywords, IEEE Transaction on Knowledge and Data Engineering, vol 26, no 4, April 2014. [7] Lisi Chen, Geo Cong, Christian S Jensen, Spatial Keyword Query ProcessingAnexperimental evaluation, Proceedings of the VLDB Endowment, vol 6, no 3, 2013. @IJRTER-2017, All Rights Reserved 27

[8] X. Cao, L Chen, Gao Cong, C. S Jensen, Spatial Keyword Querying, Information Forensics and Security, Springer 2012. [9] DongXiang Zhang, Kian Lee Tan, Anthony K. H. Tung,, Scalable Top K Spatial Keyword Search, ICDT, March 2015. [10] Bolin Ding, Bo Zhao, Efficient Keyword Based Search for Top K Cells in Text Cube, IEEE Transaction, December 2011. [11] Shuyao Qi, Dingming Wu, Nikos Mamoulis, Location Aware Keyword Query Suggestion Based on Document Proximity, IEEE Transaction on Knowledge anddata Engineering, vol 13, no 7, September 2014. @IJRTER-2017, All Rights Reserved 28