Detect tracking behavior among trajectory data

Similar documents
Mobility Data Management and Exploration: Theory and Practice

An Efficient Technique for Distance Computation in Road Networks

DS504/CS586: Big Data Analytics Data Management Prof. Yanhua Li

Voronoi-based Trajectory Search Algorithm for Multi-locations in Road Networks

Spatiotemporal Access to Moving Objects. Hao LIU, Xu GENG 17/04/2018

Constructing Popular Routes from Uncertain Trajectories

Similarity-based Analysis for Trajectory Data

DS595/CS525: Urban Network Analysis --Urban Mobility Prof. Yanhua Li

9/23/2009 CONFERENCES CONTINUOUS NEAREST NEIGHBOR SEARCH INTRODUCTION OVERVIEW PRELIMINARY -- POINT NN QUERIES

Introduction to Trajectory Clustering. By YONGLI ZHANG

Where Next? Data Mining Techniques and Challenges for Trajectory Prediction. Slides credit: Layla Pournajaf

arxiv: v1 [cs.db] 9 Mar 2018

A System for Discovering Regions of Interest from Trajectory Data

A Joint approach of Mining Trajectory Patterns according to Various Chronological Firmness

On-Line Discovery of Flock Patterns in Spatio-Temporal Data

Clustering Moving Objects in Spatial Networks

Incremental Sub-Trajectory Clustering of Large Moving Object Databases

On Discovering Moving Clusters in Spatio-temporal Data

Close Pair Queries in Moving Object Databases

Update-efficient Indexing of Moving Objects in Road Networks

M Thulasi 2 Student ( M. Tech-CSE), S V Engineering College for Women, (Affiliated to JNTU Anantapur) Tirupati, A.P, India

Mobility Data Management & Exploration

A Unified Framework for Authenticating Privacy Preserving Location Based Services

Visual Traffic Jam Analysis based on Trajectory Data

Efficient Construction of Safe Regions for Moving knn Queries Over Dynamic Datasets

Mobility Data Mining. Mobility data Analysis Foundations

Pointwise-Dense Region Queries in Spatio-temporal Databases

Dynamic Nearest Neighbor Queries in Euclidean Space

A Novel Indexing Method for BBx-Index structure

On Discovering Moving Clusters in Spatio-temporal Data

Retrieving k-nearest Neighboring Trajectories by a Set of Point Locations

An Edge-Based Algorithm for Spatial Query Processing in Real-Life Road Networks

Probabilistic Spatial Queries on Existentially Uncertain Data

Searching for Similar Trajectories on Road Networks using Spatio-Temporal Similarity

Hermes - A Framework for Location-Based Data Management *

A FRAMEWORK FOR MINING UNIFYING TRAJECTORY PATTERNS USING SPATIO- TEMPORAL DATASETS BASED ON VARYING TEMPORAL TIGHTNESS Rahila R.

F2F: Friend-To-Friend Semantic Path Recommendation

Privacy-Preserving of Check-in Services in MSNS Based on a Bit Matrix

Continuous Density Queries for Moving Objects

Goal Directed Relative Skyline Queries in Time Dependent Road Networks

Trajectory Compression under Network Constraints

Spatial Keyword Search. Presented by KWOK Chung Hin, WONG Kam Kwai

Fosca Giannotti et al,.

Benchmarking Big Data for Trip Recommendation

Effective Density Queries on Continuously Moving Objects

Approximate Evaluation of Range Nearest Neighbor Queries with Quality Guarantee

TrajStore: an Adaptive Storage System for Very Large Trajectory Data Sets

Finding both Aggregate Nearest Positive and Farthest Negative Neighbors

A Novel Method for Activity Place Sensing Based on Behavior Pattern Mining Using Crowdsourcing Trajectory Data

Generating Traffic Data

A Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods

DSTTMOD: A Discrete Spatio-Temporal Trajectory Based Moving Object Database System

Mining Trajectory Data for Discovering Communities of Moving Objects

Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei National Chiao Tung University Hsinchu, Taiwan

Mining Trajectory Patterns Using Hidden Markov Models

Quadrant-Based MBR-Tree Indexing Technique for Range Query Over HBase

Searching Similar Trajectories in Real Time: an Effectiveness and Efficiency Study *

Using Parallel Spatial Mashup Model to Process k-nn Queries

Mining Representative Movement Patterns through Compression

Effective Density Queries for Moving Objects in Road Networks

A Novel Method to Estimate the Route and Travel Time with the Help of Location Based Services

Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis

Mining Trajectory Data for Discovering Communities of Moving Objects

An Efficient Clustering Algorithm for Moving Object Trajectories

NOVEL CACHE SEARCH TO SEARCH THE KEYWORD COVERS FROM SPATIAL DATABASE

APPLICABILITY ANALYSIS OF CLOTH SIMULATION FILTERING ALGORITHM FOR MOBILE LIDAR POINT CLOUD

Continuous Evaluation of Monochromatic and Bichromatic Reverse Nearest Neighbors

Detecting Anomalous Trajectories and Traffic Services

Trajectory Compression under Network constraints

Mining Temporal Association Rules in Network Traffic Data

Nearest Neighbor Search on Moving Object Trajectories

An algorithm for Trajectories Classification

A NOVEL APPROACH ON SPATIAL OBJECTS FOR OPTIMAL ROUTE SEARCH USING BEST KEYWORD COVER QUERY

Nearest Neighbor Search on Moving Object Trajectories

Design Considerations on Implementing an Indoor Moving Objects Management System

On Reducing Communication Cost for Distributed Moving Query Monitoring Systems

ISSN: (Online) Volume 4, Issue 1, January 2016 International Journal of Advance Research in Computer Science and Management Studies

Top-k Keyword Search Over Graphs Based On Backward Search

Moving Object indexing using Crossbreed Update

Density Based Co-Location Pattern Discovery

International Journal of Scientific Research and Modern Education (IJSRME) Impact Factor: 6.225, ISSN (Online): (

Towards a Taxonomy of Location Based Services

Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce

TrajAnalytics: A software system for visual analysis of urban trajectory data

Continuous Spatiotemporal Trajectory Joins

Scalable Selective Traffic Congestion Notification

Tracking Network-Constrained Moving Objects with Group Updates

Computing Continuous Skyline Queries without Discriminating between Static and Dynamic Attributes

MobiPLACE*: A Distributed Framework for Spatio-Temporal Data Streams Processing Utilizing Mobile Clients Processing Power.

Survey on Recommendation of Personalized Travel Sequence

Efficient Index Based Query Keyword Search in the Spatial Database

Group Nearest Neighbor Queries for Fuzzy Geo-Spatial Objects

An Introduction to Spatial Databases

Best Keyword Cover Search

Aggregate-Max Nearest Neighbor Searching in the Plane

Nearest Neighbour Expansion Using Keyword Cover Search

Mining Frequent Trajectory Using FP-tree in GPS Data

Ranking Web Pages by Associating Keywords with Locations

Trajectory Voting and Classification based on Spatiotemporal Similarity in Moving Object Databases

Secure and Advanced Best Keyword Cover Search over Spatial Database

Transcription:

Detect tracking behavior among trajectory data Jianqiu Xu, Jiangang Zhou Nanjing University of Aeronautics and Astronautics, China, jianqiu@nuaa.edu.cn, jiangangzhou@nuaa.edu.cn Abstract. Due to the continuing improvements in location acquisition technology, a large population of GPS-equipped moving objects are tracked in a server. In emergency applications, users may want to detect whether a target is tracked by another object. We formulate the tracking behavior by continuous distance queries in trajectory databases. Index structures are developed to improve the query performance. Using real trajectories, we demonstrate answering continuous distance queries in a database system and animating moving objects fulfilling the distance condition in the user interface. The result benefits mining the interesting behavior among trajectory data and answering distance join queries. 1 Introduction The rise of GPS-equipped mobile devices has led to the emergence of big trajectory data. A large amount of historical trajectory data has become available for emerging applications such as traffic monitor and location-based services. Real time tracking has been studied to efficiently track an objects trajectory in real-time [12]. However, so far little attention has been paid to detect the tracking behavior over historical movement. That is, given a query trajectory o q and a distance threshold d, we aim to find whether there is any object keeping some distance to o q at each time point of o q. We formulate the problem by continuous distance queries over trajectory data in which users define a threshold to return objects within a distance (e.g., 500 meters) to the query. Consider an example in Figure 1. There are five trajectories{o 1,o 2,o 3,o 4,o 5 } and the task is to find whether o 3 is tracked by another object during [t 1, t 2 ]. Suppose that a short distance d is defined, e.g., 500m. In the example, we will find o 4 following o 3. Other objects are within the distance to o 3 but only for a short time period. This will not be treated as tracking behaivor. To find the tracker, we need to compute the timedependent distance between trajetories and determine whether the period fulfilling the distance condition contains each time point of the query trajectory. This is not a simple task because the precise distance needs to be determined. One may think of using the continuous nearest neighbor query to find the tracking behavior, but such a query will return objects with small distances at each piece of time. We need to figure out the precise distance and find whether there are objects that always keep a certain distance to the target rather than only at a short period. In the literature, a lot of effort has gone to process k nearest neighbor queries in which a distance function is defined to return k objects with the smallest distance to the target. Due to diverse spatial environments, e.g., free space [8], road network [13] and obstacle space [5], different distance functions are used. The distance is a static value

T o 1 o 3 d d o 5 t 2 t 1 o 2 Y o 4 X Fig. 1. Detect tracking behavior if a time point is considered and becomes a time-dependent function if the continuous query over a time period is considered. The latter is non-trivial because the distance varies over time such that returned objects change at certain points. In some special applications, the tracking behavior is helpful to find interesting objects that cannot be discovered by nearest neighbor queries. For example, to look for the tracker or travelers with common routes, we would like to search the object that always keeps a certain distance to the target rather than the nearest object at certain time points. One one hand, the nearest object to the target may change at some points or places. On the other hand, the nearest object is usually not the tracker because of being easily exposed. In the demo, we detect the tracking behavior by using continuous distance queries. To be general, the query is of three forms in terms of distance parameters. We are able to search objects within or out of a certain distance to the target. To efficiently answer the query, several trajectory indexes are provided in the system such as 3D R-tree and TB-tree. We propose a novel index structure that combines the grid index and R-tree. Given a query trajectory, one quickly calculates the cells within the query distance. During the R-tree traversal, we make use of the index structure to prune the space that cannot contribute to the result. Real datasets are used in the demonstration scenario for continuous distance queries and distance join queries. The rest of the paper is organized as follows. We review the related work in Section 2, formulate the problem and introduce the solution in Section 3, demonstrate continuous distance queries in Section 4, followed by conclusions in Section 5. 2 Related Work In the literature, tremendous efforts have been made on querying trajectories including nearest neighbors [6] [8], similarity search [4] [17] and pattern discovery [10] [14]. Those works focus on searching some targets close to the query, but do not consider a precise distance function to evaluate the query. Calibrating trajectory data is studied in [17], the goal of which is to transform heterogeneous trajectories to one with unified sampling strategies. Discoverying convoys in trajectory databases is to return groups of objects such that each group consists of a set of density-connected objects which should satisfy the distance requirement over a certain time period. Trajectory clustering

[11] [15] [9] also aims to form groups such that (i) points within the same group are close to each other, and (ii) points from different groups are far apart. However, the distance in convoy and cluster queries is calculated at each time point but not a timedependent function as we use for processing continuous queries. In addition, discovery of convoys and clusters in trajectory database shows the behavior for a group of objects. In contrast, we aim to find whether a target is tracked by another object, which is an individual behavior. The closest-point-of-approach join is studied in [18] [2], the task of which is to return all object pairs. Each pair of trajectories has the distance less than a threshold at some point in time. This is not a continuous query and cannot be used to detect the tracking behavior. Two objects close to each other at some point in time do not mean that one is tracked by another. Probably this is treated as passby or overtaken. We generalize the distance query by considering a time interval such that the distance between two trajetories is less than a certain value at each time point during the query. 3 The Framework 3.1 Problem definition Let the database O be a set of trajectories, each of which consists of a sequence of temporal units. Each unit represents the movement over a time interval by recording start and end locations. Locations between them are estimated by interpolation. Given two trajectorieso 1, o 2 O, we let dist(o 1, o 2, t) return the distance betweeno 1 ando 2 at time pointt. Definition 31 Tracking behavior LetT(o)(o O) return the time period of a trajectory. Given a query trajectoryo q O and a distanced, we defineo q s tracker by Tracker(o q ) O : t T(o q ), dist(o q, Tracker(o q ),t) < d We generalize the query by defining a distance range, i.e., d = [d 1, d 2 ]. If d 1 = 0 and d 2 > 0, this is used to find the tracking behavior. If 0 < d 1 < d 2, we can return objects between d 1 and d 2 to the target which may also be trackers. If d 1 > 0 and d 2 =, objects that are further than a certain distance to the target are found and can be used to exclude suspicious objects. 3.2 The solution An outline of the solution is provided in Figure 2. To efficiently find trajectories within a certain distance to the target, an index structure is essentially needed. We have implemented well established trajectory indexes including TB-tree [16], 3D R-tree and Grid index [3]. We perform a traversal on the indexes to retrieve trajectories that approximately belong to the result and then do the exact distance computation. The distance function over a time period is represented by a parabola function. We will receive pieces of distance functions and split trajectories at certain points to restrict the movement

Moving objects animation Algorithms for continuous distance queries dist d 2 split point TB-tree 3D R-tree Grid index Grid R-tree partition the trajectory d 1 Trajectory storage (a) The architecture t 0 t 1 t 2 t 3 (b) Distance curve t Fig. 2. An overview fulling the query condition. The method is able to find the tracking behavior but can be further optimized. To enhance the performance, we propose an index structure that combines the grid index and R-tree, as shown in Figure 3. The 2D space is partition into a set of equalsize cells and trajectories are decomposed according to cells. That is, each trajectory is split into pieces each of which is completely inside a cell. We sort trajectory pieces by cell id, time and spatial. Each leaf R-tree node only contains trajectories from the same cell. The query procedure will make use of the cells to prune the search space. We first determine the cells in which the query trajectory is located. Since cells have the same size, we can quickly calculate target cells which are within the specified distance to the query trajectory. Target cells will be used to prune R-tree nodes that do not cover target cells because only trajectories located in target cells will contribute to the result. cell a leaf node units in the same grid sorted units (a) Grid (b) Grid R-tree nodes Fig. 3. Grid R-tree 4 Demonstration The implementation is developed in an extensible database system SECONDO [7] and program in C/C++. We use real datasets from a data company DataTang [1] including 1,675,667 GPS records of Beijing taxis. There are 22,269 trajectories in total. In the demonstration, we randomly select a trajectory from the dataset and search trajectories keeping the specified distance to the query during its whole life time. Two

distance parameters are defined: (i) d = [0m, 5000m]; and (ii) d = [5000m, 20000m] and the results are displayed in Figure 4. We are able to animate moving objects in the java interface such that one can observe how returned objects change over time. If an object always keeps the distance to the query, then it is the tracker. (a) within 5km Fig. 4. Continuous distance queries (b) between 5km and 20km Distance join queries. We also demonstrate distance join queries over trajectories, as illustrated in Figure 5. The result is a set of pairs of trajectories such that each pair has the period of the intersection time more than a certain value (e.g., 15 min) and the distance always less than a threshold during the period. This would help finding common traveling routes among different travelers and discovering potential relationships such as neighbor and colleague. Fig. 5. Distance join queries 5 Conclusions We study detecting the tracking behavior over trajectory data and formulate the problem by continuous distance queries. The framework of processing the queries is introduced

including partitioning the data, building index structures, developing query algorithms and animating moving objects in the user interface. We demonstrate answering continuous distance and distance join queries in a prototype database system. The future work is to evaluate the system performance by using big trajetory data and flexibly visualize a large number of trajectories in the user interface. Acknowledgment This work is supported by NSFC under grant numbers 61300052 and 61373015. References 1. http://factory.datatang.com/en/ (2016). 2. S. Arumugam and C. Jermaine. Closest-point-of-approach join for moving object histories. In ICDE, page 86, 2006. 3. V. P. Chakka, A. Everspaugh, and J. M. Patel. Indexing large trajectory data sets with seti. In CIDR, 2003. 4. L. Chen, M. T. Özsu, and V. Oria. Robust and fast similarity search for moving object trajectories. In SIGMOD, pages 491 502, 2005. 5. Y. Gao and B. Zheng. Continuous obstructed nearest neighbor queries in spatial databases. In ACM SIGMOD, pages 577 590, 2009. 6. Y. Gao, B. Zheng, G. Chen, and Q. Li. Algorithms for constrained k-nearest neighbor queries over moving object trajectories. GeoInformatica, 14(2):241 276, 2010. 7. R. H. Güting, T. Behr, and C. Düntgen. SECONDO: A platform for moving objects database research and for publishing and integrating research implementations. IEEE Data Eng. Bull., 33(2):56 63, 2010. 8. R. H. Güting, T. Behr, and J. Xu. Efficient k-nearest neighbor search on moving object trajectories. VLDB J., 19(5):687 714, 2010. 9. C.C. Hung, W.C. Peng, and W.C. Lee. Clustering and aggregating clues of trajectories for mining trajectory patterns and routes. VLDB J., 24(2):169 192, 2015. 10. H. Jeung, M.L. Yiu, X. Zhou, C. S. Jensen, and H.T. Shen. Discovery of convoys in trajectory databases. PVLDB, 1(1):1068 1080, 2008. 11. P. Kalnis, N. Mamoulis, and S. Bakiras. On discovering moving clusters in spatio-temporal data. In SSTD, pages 364 381, 2005. 12. R. Lange, F. Dürr, and K. Rothermel. Efficient real-time trajectory tracking. VLDB J., 20(5):671 694, 2011. 13. Y. Li, J. Li, L. Shu, Q. Li, G. Li, and F. Yang. Searching continuous nearest neighbors in road networks on the air. Inf. Syst., 42:177 194, 2014. 14. Z. Li, B. Ding, J. Han, and R. Kays. Swarm: Mining relaxed temporal moving object clusters. PVLDB, 3(1):723 734, 2010. 15. N. Pelekis, P. Tampakis, M. Vodas, C. Panagiotakis, and Y. Theodoridis. In-dbms samplingbased sub-trajectory clustering. In EDBT 2017, pages 632 643, 2017. 16. D. Pfoser and C.S. Jensen. Novel approaches in query processing for moving object trajectories. In VLDB, pages 395 406, 2000. 17. H. Su, K. Zheng, H. Wang, J. Huang, and X. Zhou. Calibrating trajectory data for similaritybased analysis. In SIGMOD, pages 833 844, 2013. 18. P. Zhou, D. Zhang, B. Salzberg, G. Cooperman, and G. Kollios. Close pair queries in moving object databases. In ACM-GIS, pages 2 11, 2005.