Mobility Data Management and Exploration: Theory and Practice

Similar documents
Detect tracking behavior among trajectory data

Spatiotemporal Access to Moving Objects. Hao LIU, Xu GENG 17/04/2018

An Efficient Technique for Distance Computation in Road Networks

Hermes - A Framework for Location-Based Data Management *

Mobility Data Management & Exploration

Contents. Part I Setting the Scene

Trajectory Compression under Network Constraints

Incremental Sub-Trajectory Clustering of Large Moving Object Databases

A Novel Indexing Method for BBx-Index structure

Update-efficient Indexing of Moving Objects in Road Networks

Spatio-temporal Access Methods

Mobility Data Mining. Mobility data Analysis Foundations

Incremental Nearest-Neighbor Search in Moving Objects

Similarity-based Analysis for Trajectory Data

Approximate Evaluation of Range Nearest Neighbor Queries with Quality Guarantee

Offline Approaches for Preserving Privacy of Trajectories on the Road Networks

DSTTMOD: A Discrete Spatio-Temporal Trajectory Based Moving Object Database System

Towards a Taxonomy of Location Based Services

Data mining and warehousing for Temporal Data Objects Kola Surya Prakash Asst Prof Computer Science Tagore Arts College, Lawspet, puducherry

DS504/CS586: Big Data Analytics Data Management Prof. Yanhua Li

Continuous Query Processing in Spatio-temporal Databases

Mobile Location Systems, Tools, and Services

Algebras for Moving Objects and Their Implementation*

A Spatio-temporal Access Method based on Snapshots and Events

Moving Object indexing using Crossbreed Update

Nearest Neighbor Search on Moving Object Trajectories

Implementation of Spatio-Temporal Data Types with Java Generics

Nearest Neighbor Search on Moving Object Trajectories

Continuous Density Queries for Moving Objects

Comparative Analysis of Proposed POBBx-Index Structure

Design Considerations on Implementing an Indoor Moving Objects Management System

Close Pair Queries in Moving Object Databases

Where Next? Data Mining Techniques and Challenges for Trajectory Prediction. Slides credit: Layla Pournajaf

A System for Discovering Regions of Interest from Trajectory Data

A cost model for spatio-temporal queries using the TPR-tree

Searching Similar Trajectories in Real Time: an Effectiveness and Efficiency Study *

Pointwise-Dense Region Queries in Spatio-temporal Databases

Managing Uncertain Spatio-Temporal Data

Chapter 1, Introduction

Trajectory Compression under Network constraints

Data Structures for Moving Objects on Fixed Networks

IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 3, No 1, May 2012 ISSN (Online):

Trajectory Queries and Octagons in Moving Object Databases

R-trees with Update Memos

Seismo-Surfer: A Prototype for Collecting, Querying, and Mining Seismic Data

Kyriakos Mouratidis. Curriculum Vitae September 2017

Probabilistic Spatial Queries on Existentially Uncertain Data

Continuous Evaluation of Monochromatic and Bichromatic Reverse Nearest Neighbors

Dynamic Nearest Neighbor Queries in Euclidean Space

Overview of Data Management

Voronoi-based Trajectory Search Algorithm for Multi-locations in Road Networks

Nearest Neighbor Search on Moving Object Trajectories

9/23/2009 CONFERENCES CONTINUOUS NEAREST NEIGHBOR SEARCH INTRODUCTION OVERVIEW PRELIMINARY -- POINT NN QUERIES

PA-Tree: A Parametric Indexing Scheme for Spatio-temporal Trajectories

Quadrant-Based MBR-Tree Indexing Technique for Range Query Over HBase

Query Optimization for Spatio-temporal Data Stream Management Systems

SEST L : An Event-Oriented Spatio-Temporal Access Method

Data Engineering. June 2002 Vol. 25 No. 2 IEEE Computer Society

Processing Skyline Queries in Temporal Databases

Dynamic Monitoring of Moving Objects: A Novel Model to Improve Efficiency, Privacy and Accuracy of the Framework

Question Bank. 4) It is the source of information later delivered to data marts.

TRAJECTORY PATTERN MINING

Situational Awareness over Large Spatio-Temporal Databases

MobiPLACE*: A Distributed Framework for Spatio-Temporal Data Streams Processing Utilizing Mobile Clients Processing Power.

Continuous Spatiotemporal Trajectory Joins

SPATIAL RANGE QUERY. Rooma Rathore Graduate Student University of Minnesota

R-trees with Update Memos

Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index

Data Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems

RESEARCH ON UNIFIED SPATIOTEMPORAL DATA MODEL

On Discovering Moving Clusters in Spatio-temporal Data

The Design of a Spatio-Temporal Database to investigate on sex offenders

Data Mining & Data Warehouse

Indexing the Positions of Continuously Moving Objects

Semantic Optimization of Preference Queries

CHOROCHRONOS: Research on Spatiotemporal Database Systems

Abstract. 1. Introduction

Indexing the Trajectories of Moving Objects in Networks

arxiv: v1 [cs.db] 9 Mar 2018

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

Continuous Intersection Joins Over Moving Objects

Effective Density Queries on Continuously Moving Objects

Rapporto di Ricerca CS Frequent Spatio-Temporal Patterns in Trajectory Data Warehouses

A Novel Method to Estimate the Route and Travel Time with the Help of Location Based Services

Mining Trajectory Patterns Using Hidden Markov Models

System Energy Efficiency Lab seelab.ucsd.edu. Jinseok Yang

Moving Objects Analytics: Survey on Future Location & Trajectory Prediction Methods

On-Line Discovery of Dense Areas in Spatio-temporal Databases

Introduction to Temporal Database Research. Outline

Scalable Selective Traffic Congestion Notification

SPATIOTEMPORAL INDEXING MECHANISM BASED ON SNAPSHOT-INCREMENT

Data Warehousing & Mining. Data integration. OLTP versus OLAP. CPS 116 Introduction to Database Systems

Department of Information Technology B.E/B.Tech : CSE/IT Regulation: 2013 Sub. Code / Sub. Name : CS6302 Database Management Systems

Optimizing Moving Queries over Moving Object Data Streams

must uncertainty interval of object 2 uncertainty interval of object 1

Clustering Spatio-Temporal Patterns using Levelwise Search

SQL Server Analysis Services

Modeling Historical and Future Spatio-Temporal Relationships of Moving Objects in Databases

Introduction to Databases CS348

Hidden Markov Models. Slides adapted from Joyce Ho, David Sontag, Geoffrey Hinton, Eric Xing, and Nicholas Ruozzi

Transcription:

Mobility Data Management and Exploration: Theory and Practice Chapter 4 -Mobility data management at the physical level Nikos Pelekis & Yannis Theodoridis InfoLab, University of Piraeus, Greece infolab.cs.unipi.gr A guided tour on Mobility Data Mgmt I. Mobility data storage and querying Efficient trajectory indexing and storage in MODs II. Mobility-aware applications and tools Location-based services and tools; Algorithms and operations for LBS III. Mobility data analysis and mining Trajectory warehousing and OLAP; Mobility data mining and reasoning; Visual analytics for mobility data IV. Privacy aspects Preserving user traces anonymity V. Outlook Open issues Future Challenges 2 1

Mobility data storage and querying πάνταρει - Everything changes and nothing remains still Heraclitus (500 BC) 3 Key questions that arise How to reconstruct a trajectory from raw logs? How to store and query trajectoriesin a DBMS? Is a trajectory simply a sequence of (x, y, t) tuples? What kind of analysis is suitable for mobility data? In particular, trajectories of moving objects? How does infrastructure (e.g. road network) affect this analysis? Which patterns / models can be extracted out of them? Clusters, frequent patterns, anomalies / outliers, etc. How to compute such patterns / models efficiently? How to protect privacy / anonymity? trade-off between privacy protection and quality of analysis 4 2

Efficiently trajectory indexing and storage in MODs Indexing techniques MOD engines 5 R-trees for spatial data For d-dimensional point or region data Is it portable to mobility data? In other words, is space + time simply a 3D space? 6 3

What?? for mobility data Challenges: Both space and time are equally important! But! time is not simply a 3 rd dimension think of its monotonicity Also, density in the spatio-temporal space could be quite different from place to place Diversity in space, in time, in space-time 7 Diversity in space Time: 05/06/2008 05:03 pm The number of objects varies in different regions 8 4

Diversity in time Time: 05/06/2008 05:03 pm Time: 05/06/2008 08:05 am The number of objects changes with time. 9 Diversity in space & time Time: 05/06/2008 05:03 pm Time: 05/06/2008 08:05 am The distribution of objects also changes with time 10 5

Back to technical stuff Assume we have stored trajectories (i.e., the recorded locations of a moving object over time) Main Question: how can we approximate a trajectory? Like, e.g. MBRs for (static) spatial objects? Then, we could build indices upon the approximations x y time 11 Two approaches: native vs. parametric space Typically approximate using MBRs; then index these MBRs we can use R-trees etc. trajectories are lines, thus MBRs add extensive empty space How many MBRs per trajectory? One MBR per trajectory (too much empty space ) or one MBR per segment (too many MBRs ) Can we do anything better? Smart partitioning for MBRs [Hadjieleftheriou et al. 2002]...... 12 6

MOD indexing techniques Indexing the past (= trajectories) unconstrained movement : the trajectory-bundle tree (TB-tree) [Pfoseret al. 2000] network-constrained movement: the fixed-network-restricted tree (FNR-tree) [Frentzos, 2003] Indexing the present (and anticipated future) Data partitioning: TPR-tree [Saltenis et al. 2000], TPR*-tree [Tao et al. 2003] Space partitioning: B x -tree [Jensen et al. 2004], ST 2 B-tree [Chen et al. 2008] (Hybrid solution for) Indexing the past & present R PPF -tree [Pelaniset al. 2006] We focus on historical tracks of moving objects (trajectories) concentrate on the 1 st group 13 TB-tree [Pfoser et al. 2000] Maintains the trajectory concept Each node consists of segments of a single trajectory nodes corresponding to the same trajectory are linked together in a chain t11 Effective for trajectory-oriented queries t7 t3 t1 t1 t2 t3 t4 t5 t6 t7 t8 t9 t10 t11 t12 14 7

FNR-tree (Frentzos, 2003) a forest of 1D (temporal) R-trees on top of a 2D (spatial) R-tree There is an additional Parent 1D R-tree which indexes the temporal intervals of the 1D R-trees leaf nodes 2D R-tree (spatial) Children 1D R-trees (temporal) P arent 1 D R -tree (te m po ra l) 15 Moving Objects Database Systems From traditional DBMS to Moving Object Database (MOD) engines Data types, indices, query processing & optimization strategies for trajectories Spatial and temporal dimensions are considered as first-class citizens. Several prototype MOD engines SECONDO(Güting et. al.) IDEAS 00, ICDE 05, MDM 06 PLACE(Aref et al.) SSDBM 04, VLDB 04 HERMES(Pelekis et. al.) EDBT 06, SIGMOD 08, IJKBO 11 16 8

SECONDO http://dna.fernuni-hagen.de/secondo.html/index.html A generic DBMS framework that can be filled with implementation of various data models (R, OR, XML) and data types (spatial data, moving objects) Built on top of Berkeley DB. A MOD is a set of SECONDO objects of the form (name, type, value), where type is one of the implemented algebras About 20 implemented algebras standard algebra, relational algebra, R-Tree algebra, spatial algebra, etc. Query optimizer includes optimization of conjunctive queries, selectivity estimation, and implementation of an SQL-like query language 17 The PLACE Server http://www.cs.purdue.edu/place/ Continuousevaluation of queries over spatio-temporal data streams Shared execution among concurrent continuous queries Built on top of PREDATOR database system Incremental evaluation of continuous queries Spatio-temporal query operators 18 9

The Hermes MOD engine http://infolab.cs.unipi.gr/hermes/ A palette of Abstract Data Types on top of an extensible DBMS Moving point, moving line, moving polygon, etc. Linear vs. Arc movement R-tree and TB-tree indexing support xx' t ε [t1, t2) -> Linear movement t ε [t2, t3) -> Arc movement t ε [t3, t4) -> Const movement t ε [t4, t5) -> Linear movement tt' yy' t1 t2 t3 t4 t5 19 Summary 20 10

Summary on Mobility Data Management Open issues: Trajectory indexing Hybrid(past- and present- locations) indexing techniques Advanced MOD architectures centralized vs. distributed vs. stream-oriented 21 Questions 22 11

Reading list Mobility data modeling & MOD engines de Almeida, V.T. et al. (2006) Querying Moving Objects in SECONDO. Proceedings of MDM. Behr, T. and Güting, R.H. (2005) Fuzzy Spatial Objects: An Algebra Implementation in SECONDO. Proceedings of ICDE. Cao, H. and Wolfson, O. (2005) NonmaterializedMotion Information in Transport Networks. Proceedings of ICDT. Chen, C.X. and Zaniolo, C. (2000) SQLST: A Spatio-Temporal Data Model and Query Language. Proceedings of ER. Cheng, R. et al. (2004) Efficient Indexing Methods for Probabilistic Threshold Queries over Uncertain Data. Proceedings of VLDB. Dieker, S. and Güting, R.H. (2000) Plug and Play with Query Algebras: SECONDO A Generic DBMS Development Environment. Proceedings of IDEAS. 23 Reading list Mobility data modeling & MOD engines (cont.) Güting, R.H. et al. (2000) A Foundation for Representing and Querying Moving Objects. ACM Transactions on Database Systems, 25(1):1-42. Güting, R.H. et al. (2006) Modeling and querying moving objects in networks. VLDB Journal, 15(2): 165-190. Karimi, H. and Liu, X. (2003) A Predictive Location Model for Location-Based Services. Proceedings of ACM-GIS. Marketos, G. et al. (2008) Building real-world trajectory warehouses. Proceedings of MobiDE. Mokbel, M.F. et al. (2004a) Continuous Query Processing of Spatio-temporal Data Streams in PLACE. Proceedings of SSDBM. Mokbel, M.F. et al. (2004b) PLACE: A Query Processor for Handling Real-time Spatio-temporal Data Streams. Proceedings of VLDB. 24 12

Reading list Mobility data modeling & MOD engines (cont.) Mokhtar, H., and Su, J. (2005) A Query Language for Moving Object Trajectories. Proceedings of SSDBM. Patroumpas, K. and Sellis, T.K. (2004) Managing Trajectories of Moving Objects as Data Streams. Proceedings of STDBM. Pelekis, N. and Theodoridis, Y. (2007) An Oracle Data Cartridge for Moving Objects. Technical Report, TR-2007-04, University of Piraeus. Pelekis, N. et al. (2004) Literature Review of Spatio-temporal Database Models. Knowledge Engineering Review, 19(3): 235-274. Pelekis, N. et al. (2006) Hermes -A Framework for Location-Based Data Management. Proceedings of EDBT. Pelekis, N. et al. (2008) HERMES: aggregative LBS via a trajectory DB engine. Proceedings of ACM SIGMOD. Pelekis, N. et al. (2011) HERMES: A Trajectory DB Engine for Mobility-Centric Applications, International Journal of Knowledge-based Organizations, in press.. 25 Reading list MOD Indexing Cai, Y. and Ng, R.T. (2004) Indexing Spatio-Temporal Trajectories with Chebyshev Polynomials. Proceedings of ACM SIGMOD. Chen, S. et al. (2008) ST 2 B-tree: A Self-Tunable Spatio-Temporal B+-tree Index for Moving Objects. Proceedings of ACM SIGMOD. Frentzos, E.(2003)Indexing Objects Moving on Fixed Networks.Proceedings of SSTD. Hadjieleftheriou, M. et al. (2006) Indexing Spatio-temporal Archives. VLDB Journal, 15(2): 143-164. Jensen, C. S. et al. (2004) Query and update efficient B + -tree based indexing of moving objects. Proceedings of VLDB. Kollios, G. et al. (2001) Indexing Animated Objects Using Spatiotemporal Access Methods. IEEE Trans. Knowledge and Data Engineering, 13(5): 758-777. 26 13

Reading list MOD Indexing (cont.) Myllymaki, J. and Kaufman, J. (2003) High-Performance Spatial Indexing for Location-Based Services. Proceedings of WWW. Ni, J. and Ravishankar, C.V. (2007)Indexing Spatio-Temporal Trajectories with Efficient Polynomial Approximations.IEEE Trans. Knowledge and Data Engineering, 19(5): 663-678. Pelanis, M. et al. (2006). Indexing the past, present, and anticipated future positions of moving objects. ACM Trans. Database Systems, 31(1): 255-298. Pfoser,D. et al. (2000)Novel Approaches to the Indexing of Moving Object Trajectories. Proceedings of VLDB. Rasetic, S. et al. (2005) A Trajectory Splitting Model for Efficient Spatio- Temporal Indexing. Proceedings of VLDB. 27 Reading list MOD Indexing (cont.) Saltenis, S. et al. (2000) Indexing the Positions of Continuously Moving Objects. Proceedings of ACM SIGMOD. Saltenis, S. and C.S. Jensen (2002) Indexing of Moving Objects for Location- Based Services. Proceedings of ICDE. Tao, Y. and Papadias, D. (2001) MV3R-Tree: A Spatio-Temporal Access Method for Timestamp and Interval Queries. Proceedings of VLDB. Tao, Y. et al. (2003) The TPR*-tree:an optimized spatio-temporal access method for predictive queries. Proceedings of VLDB. 28 14