Contents. Part I Setting the Scene

Similar documents
Business Intelligence Roadmap HDT923 Three Days

Contents. Foreword to Second Edition. Acknowledgments About the Authors

Time: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A.

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

Code No: R Set No. 1

Evolving To The Big Data Warehouse

Mobility Data Management and Exploration: Theory and Practice

Question Bank. 4) It is the source of information later delivered to data marts.

Table Of Contents: xix Foreword to Second Edition

Big Trend in Business Intelligence: Data Mining over Big Data Web Transaction Data. Fall 2012

Publishing CitiSense Data: Privacy Concerns and Remedies

Tribhuvan University Institute of Science and Technology MODEL QUESTION

Getting more from your Engineering Data. John Chapman Regional Technical Manager

Chapter 1, Introduction

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Chapter 3. Foundations of Business Intelligence: Databases and Information Management

Data Warehousing & Mining. Data integration. OLTP versus OLAP. CPS 116 Introduction to Database Systems

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER

TIM 50 - Business Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

"Charting the Course... MOC B Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course Summary

Capture Business Opportunities from Systems of Record and Systems of Innovation

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Part I: Data Mining Foundations

DOTNET PROJECTS. DOTNET Projects. I. IEEE based IOT IEEE BASED CLOUD COMPUTING

Table of Contents 1 Introduction A Declarative Approach to Entity Resolution... 17

High-Performance Parallel Database Processing and Grid Databases

Data Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems

PATTERN CLASSIFICATION AND SCENE ANALYSIS

Foundations of SQL Server 2008 R2 Business. Intelligence. Second Edition. Guy Fouche. Lynn Lang it. Apress*

Chapter 6 VIDEO CASES

International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.7, No.3, May Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani

Data Management Glossary

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data.

Privacy-Preserving. Introduction to. Data Publishing. Concepts and Techniques. Benjamin C. M. Fung, Ke Wang, Chapman & Hall/CRC. S.

Complete. The. Reference. Christopher Adamson. Mc Grauu. LlLIJBB. New York Chicago. San Francisco Lisbon London Madrid Mexico City

Applying big data analytics in practice

What is Gluent? The Gluent Data Platform

The University of Iowa Intelligent Systems Laboratory The University of Iowa Intelligent Systems Laboratory

Learning Objectives for Data Concept and Visualization

Computational Databases: Inspirations from Statistical Software. Linnea Passing, Technical University of Munich

SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE)

Introduction to Data Mining and Data Analytics

Data Mining. Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING

Dr.G.R.Damodaran College of Science

IEEE Project Titles

Big Data with Hadoop Ecosystem

DATA MINING AND WAREHOUSING

Data Warehousing & Mining Techniques for Moving Object Databases

Data Warehouse and Data Mining

Oracle Big Data Connectors

(All chapters begin with an Introduction end with a Summary, Exercises, and Reference and Bibliography) Preliminaries An Overview of Database

Mobility Data Management & Exploration

CONTENTS. Computer-System Structures

Hive SQL over Hadoop

@Pentaho #BigDataWebSeries

TIM 50 - Business Information Systems

Oracle Big Data Fundamentals Ed 2

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools

Adding Meaning to your Steps

Data Warehouse and Data Mining

Lectures for the course: Data Warehousing and Data Mining (IT 60107)

DATA WAREHOUING UNIT I

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

20466C - Version: 1. Implementing Data Models and Reports with Microsoft SQL Server

Database and Knowledge-Base Systems: Data Mining. Martin Ester

by Prentice Hall

SAP CERTIFIED APPLICATION ASSOCIATE - SAP HANA 2.0 (SPS01)

Data Mining and Data Warehousing Introduction to Data Mining

Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer

a linear algebra approach to olap

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research

Section A. 1. a) Explain the evolution of information systems into today s complex information ecosystems and its consequences.

Contents. Preface to the Second Edition

TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. ABSTRACT 5 LIST OF TABLES LIST OF FIGURES LIST OF SYMBOLS AND ABBREVIATIONS xxi

Microsoft Analytics Platform System (APS)

Data Warehouse and Data Mining

SAS Data Integration Studio 3.3. User s Guide

9. Conclusions. 9.1 Definition KDD

Hermes - A Framework for Location-Based Data Management *

Time Distortion Anonymization for the Publication of Mobility Data with High Utility

BIG DATA SCIENTIST Certification. Big Data Scientist

TABLE OF CONTENTS CHAPTER NO. TITLE PAGENO. LIST OF TABLES LIST OF FIGURES LIST OF ABRIVATION

Blended Learning Outline: Cloudera Data Analyst Training (171219a)

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management

Data Mining and Warehousing

Creating Connection With Hive. Version: 16.0

Oracle #1 RDBMS Vendor

1 (eagle_eye) and Naeem Latif

R07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis.

DATA MINING TRANSACTION

Data Mining & Data Warehouse

Microsoft End to End Business Intelligence Boot Camp

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

Chapter 28. Outline. Definitions of Data Mining. Data Mining Concepts

Transcription:

Contents Part I Setting the Scene 1 Introduction... 3 1.1 About Mobility Data... 3 1.1.1 Global Positioning System (GPS)... 5 1.1.2 Format of GPS Data... 6 1.1.3 Examples of Trajectory Datasets... 8 1.2 What Can We Learn from Mobility Data... 9 1.3 Location- and Mobility-Aware Applications... 12 1.4 Adding Mobility in Spatial Database Systems... 15 1.5 Summary... 17 1.6 Exercises... 17 1.7 Bibliographical Notes and Online Resources... 17 References... 19 2 Background on Spatial Data Management and Exploration... 21 2.1 Spatial Data Modeling... 21 2.2 Spatial Database Management... 25 2.2.1 Abstract Data Types... 25 2.2.2 Indexing and Query Processing Issues... 26 2.3 Spatial Data Warehousing... 32 2.4 Spatial Data Mining... 34 2.4.1 Cluster Analysis... 35 2.4.2 Co-Location Pattern Mining... 39 2.5 Data Privacy Aspects... 39 2.6 Summary... 42 2.7 Exercises... 43 2.8 Bibliographical Notes... 44 References... 46 xi

xii Contents Part II Mobility Data Management 3 Modeling and Acquiring Mobility Data... 51 3.1 Modeling Mobility Data... 51 3.2 Acquiring Trajectories from Raw Data... 54 3.2.1 GPS Data Cleansing... 56 3.2.2 Trajectory Identification... 57 3.3 Trajectory Reconstruction and Simplification... 59 3.3.1 Trajectory Reconstruction via Map-Matching... 59 3.3.2 Trajectory Simplification via Data Compression... 63 3.4 Trajectory Data Generators... 64 3.4.1 Generating Trajectories in Free Space... 65 3.4.2 Generating Network-Constrained Trajectories... 65 3.5 Summary... 69 3.6 Exercises... 70 3.7 Bibliographical Notes... 70 References... 72 4 Mobility Database Management... 75 4.1 Location- and Mobile-Aware Querying... 75 4.1.1 Location-Oriented Queries... 77 4.1.2 Trajectory-Oriented Queries... 78 4.1.3 Querying Under Uncertainty... 79 4.2 Indexing Techniques for Mobility Data... 81 4.2.1 Indexing Trajectories in Free Space... 82 4.2.2 Indexing Network-Constrained Trajectories... 85 4.3 Query Processing Techniques... 87 4.3.1 Processing Location-Oriented Queries... 87 4.3.2 Processing Trajectory-Oriented Queries... 89 4.4 Benchmarks... 91 4.5 Summary... 93 4.6 Exercises... 94 4.7 Bibliographical Notes... 95 References... 97 5 Moving Object Database Engines... 101 5.1 From Spatial Database Systems to MOD Engines... 102 5.1.1 SECONDO... 103 5.1.2 Hermes... 104 5.2 A Data Type Model for Trajectory Databases... 105 5.2.1 Preliminaries of Trajectory Data Types... 105 5.2.2 Trajectory-Oriented Data Types... 106 5.3 Extending the Trajectory Data Type Model with Object Methods and Operators... 109 5.3.1 Predicates and Projection Methods... 109 5.3.2 Numeric Operations... 111

Contents xiii 5.3.3 Distance Functions... 111 5.3.4 Query Operators... 112 5.4 On Mobility Data Provenance... 113 5.5 Summary... 114 5.6 Exercises... 115 5.7 Bibliographical Notes... 116 References... 117 Part III Mobility Data Exploration 6 Preparing for Mobility Data Exploration... 121 6.1 Mobility Data Warehousing... 121 6.1.1 Modeling Trajectory Data Cubes... 122 6.1.2 Performing ETL Process... 123 6.2 OLAP Analysis in Trajectory Data Cubes... 125 6.2.1 Addressing the Distinct Count Problem... 126 6.2.2 Indexing Summary Information for Efficient OLAP... 126 6.3 Calculating Similarity Between Trajectories... 128 6.3.1 Functions Computed over the Sampled Points... 128 6.3.2 Computing the Similarity Between Entire Trajectories or Sub-trajectories... 134 6.4 Summary... 137 6.5 Exercises... 138 6.6 Bibliographical Notes... 138 References... 140 7 Mobility Data Mining and Knowledge Discovery... 143 7.1 Clustering in Mobility Data... 143 7.1.1 Extending Off-the-Shelf Algorithms for Trajectory Clustering... 143 7.1.2 Sub-trajectory Clustering Methods... 147 7.1.3 Finding Representatives in a Trajectory Dataset... 150 7.2 Moving Clusters for Capturing Collective Mobility Behavior... 152 7.2.1 Flocks and Variants... 152 7.2.2 Moving Clusters... 154 7.2.3 Improvements over Flocks and Moving Clusters... 155 7.3 Sequence Pattern Mining in Mobility Data... 157 7.4 Prediction and Classification in Mobility Data... 159 7.4.1 Future Location Prediction... 159 7.4.2 Classification and Outlier Detection... 160 7.5 Summary... 163 7.6 Exercises... 164 7.7 Bibliographical Notes... 165 References... 166

xiv Contents 8 Privacy-Aware Mobility Data Exploration... 169 8.1 Privacy in Location-Based Services... 170 8.1.1 Privacy in Snapshot LBS... 171 8.1.2 Privacy in Continuous LBS... 173 8.2 Privacy Preserving Mobility Data Publishing... 175 8.2.1 Never-Walk-Alone (NWA)... 176 8.2.2 Always-Walk-with-Others (AWO)... 177 8.3 Privacy Preserving Mobility Data Querying... 178 8.4 Summary... 181 8.5 Exercises... 182 8.6 Bibliographical Notes... 183 References... 184 Part IV Advanced Topics 9 Semantic Aspects on Mobility Data... 189 9.1 From Raw to Semantic Trajectories... 190 9.2 The Semantic Enrichment Process of Raw Trajectories... 191 9.2.1 Trajectory Segmentation and Stop Discovery... 192 9.2.2 Semantic Annotation of Episodes... 194 9.3 Semantic Trajectory Data Management... 196 9.3.1 A Datatype System for Semantic Trajectories... 197 9.3.2 Indexing Semantic-Aware Trajectory Databases... 200 9.4 Semantic Trajectory Data Exploration... 201 9.4.1 Semantic-Aware Trajectory Data Warehouses... 201 9.4.2 Mining Semantic Trajectory Databases... 203 9.5 Semantic Aspects of Privacy... 204 9.5.1 LBS for Sensitive Semantic Locations... 204 9.5.2 Privacy in Semantic Trajectory Databases... 205 9.6 Summary... 206 9.7 Exercises... 206 9.8 Bibliographical Notes... 207 References... 208 10 The Case of Big Mobility Data... 211 10.1 Introduction to Big Data... 211 10.2 The MapReduce Programming Model... 213 10.2.1 Hadoop... 215 10.2.2 HadoopDB... 217 10.3 Handling Big Spatial Data... 218 10.3.1 MapReduce-Based Approaches... 219 10.3.2 A Hybrid Spatial DBMS MapReduce Approach... 221

Contents xv 10.4 Handling Big Mobility Data... 222 10.4.1 Offline Mobility Data Analytics... 222 10.4.2 Hybrid Historical Real-Time Approaches Using MapReduce... 224 10.5 Summary... 228 10.6 Exercises... 229 10.7 Bibliographical Notes... 230 References... 231 Part V Epilogue, Hands-on 11 Epilogue... 235 11.1 Bibliographical Notes... 236 References... 237 12 Hands-on with Hermes@Oracle MOD... 239 12.1 Introduction: The Hermes@Oracle Data Type System... 239 12.2 The Attiki Dataset... 241 12.3 Extracting Dataset Statistics... 243 12.4 Querying the Raw GPS Part of the Dataset... 245 12.4.1 Queries on Individual Trajectories... 245 12.4.2 Index-Supported Queries... 254 12.5 Querying the Semantically-Enriched Part of the Dataset... 258 12.6 Trajectory Warehousing and OLAP in Hermes@Oracle... 265 12.7 Progressive Explorative Analysis via Querying and Mining Operations... 267 13 Hands-on with Hermes@Postgres MOD... 279 13.1 Introduction: The Hermes@Postgres Data Type System... 279 13.2 AIS Dataset Description... 280 13.3 Loading the AIS Dataset into Hermes@Postgres... 281 13.4 Querying the AIS Dataset... 283 13.4.1 Timeslice, Range and Nearest-Neighbor Queries... 283 13.4.2 Join Queries... 288 13.4.3 Topological Queries... 290 13.4.4 Cross-Tab Queries... 294 13.5 Visualization Tips... 297 Authors Bios... 299

http://www.springer.com/978-1-4939-0391-7