AN IMPROVED TAIPEI BUS ESTIMATION-TIME-OF-ARRIVAL (ETA) MODEL BASED ON INTEGRATED ANALYSIS ON HISTORICAL AND REAL-TIME BUS POSITION

Similar documents
DS504/CS586: Big Data Analytics Data Pre-processing and Cleaning Prof. Yanhua Li

DS504/CS586: Big Data Analytics Data Pre-processing and Cleaning Prof. Yanhua Li

DS595/CS525: Urban Network Analysis --Urban Mobility Prof. Yanhua Li

Semi-Automatic Tool for Map-Matching Bus Routes on High- Resolution Navigation Networks

Constructing Popular Routes from Uncertain Trajectories

Predicting Bus Arrivals Using One Bus Away Real-Time Data

Accuracy of Matching between Probe-Vehicle and GIS Map

Genetic Algorithm for Seismic Velocity Picking

Research on Destination Prediction for Urban Taxi based on GPS Trajectory

M Thulasi 2 Student ( M. Tech-CSE), S V Engineering College for Women, (Affiliated to JNTU Anantapur) Tirupati, A.P, India

3 The standard grid. N ode(0.0001,0.0004) Longitude

Traffic Flow Prediction Based on the location of Big Data. Xijun Zhang, Zhanting Yuan

DIAL: A Distributed Adaptive-Learning Routing Method in VDTNs

Open Access Research on the Prediction Model of Material Cost Based on Data Mining

Mobile-Gateway Routing for Vehicular Networks 1

Nodes Energy Conserving Algorithms to prevent Partitioning in Wireless Sensor Networks

Optimal Design of Steel Columns with Axial Load Using Artificial Neural Networks

Exploring Campus Netflow for Managing Network

Reconstructing Uncertain Pedestrian Trajectories From Low-Sampling-Rate Observations

Semi-Automatic Tool for Bus Route Map Matching

Spatial Outlier Detection

An Edge-Based Algorithm for Spatial Query Processing in Real-Life Road Networks

A Novel Method for Activity Place Sensing Based on Behavior Pattern Mining Using Crowdsourcing Trajectory Data

Trip Reconstruction and Transportation Mode Extraction on Low Data Rate GPS Data from Mobile Phone

International Journal of Scientific Research and Modern Education (IJSRME) Impact Factor: 6.225, ISSN (Online): (

Trajectory analysis. Ivan Kukanov

T2CBS: Mining Taxi Trajectories for Customized Bus Systems

A Method of Identifying the P2P File Sharing

Comparison of integrated GPS-IMU aided by map matching and stand-alone GPS aided by map matching for urban and suburban areas

Periodic Pattern Mining Based on GPS Trajectories

CHAPTER 3 MAP DIGITIZING

Reduction Deliverable Intelligent V2V and V2I communication

Efficient Index Based Query Keyword Search in the Spatial Database

Prediction of traffic flow based on the EMD and wavelet neural network Teng Feng 1,a,Xiaohong Wang 1,b,Yunlai He 1,c

A Reduce Identical Composite Event Transmission Algorithm for Wireless Sensor Networks

Software Development of Automatic Data Collector for Bus Route Planning System

On The Method and Performance Evaluation of A Hybrid Mesh-Tree Topology

Video Inter-frame Forgery Identification Based on Optical Flow Consistency

Smart Waste Management using Internet-of-Things (IoT)

Spatiotemporal Access to Moving Objects. Hao LIU, Xu GENG 17/04/2018

SMART MAPS BASED ON GPS PATH LOGS

Restricted Nearest Feature Line with Ellipse for Face Recognition

Target Tracking in Wireless Sensor Network

Research on Stability of Power Carrier Technology in Streetlight Monitoring System

GPS/GIS Activities Summary

arxiv: v1 [stat.ml] 29 Nov 2016

Using The Heuristic Genetic Algorithm in Multi-runway Aircraft Landing Scheduling

Efficient Path Finding Method Based Evaluation Function in Large Scene Online Games and Its Application

METRO BUS TRACKING SYSTEM

Learning the Three Factors of a Non-overlapping Multi-camera Network Topology

An Improved Method of Vehicle Driving Cycle Construction: A Case Study of Beijing

An ELM-based traffic flow prediction method adapted to different data types Wang Xingchao1, a, Hu Jianming2, b, Zhang Yi3 and Wang Zhenyu4

Temporally Adaptive A* Algorithm on Time Dependent Transportation Network

Public Transportation Routing using Route Graph

Expectation and Maximization Algorithm for Estimating Parameters of a Simple Partial Erasure Model

* Hyun Suk Park. Korea Institute of Civil Engineering and Building, 283 Goyangdae-Ro Goyang-Si, Korea. Corresponding Author: Hyun Suk Park

Detect tracking behavior among trajectory data

Epipolar geometry-based ego-localization using an in-vehicle monocular camera

Research on the Checkpoint Server Selection Strategy Based on the Mobile Prediction in Autonomous Vehicular Cloud

Detecting Anomalous Trajectories and Traffic Services

Linear Quadtree Construction in Real Time *

Temperature Calculation of Pellet Rotary Kiln Based on Texture

Recommendation System for Location-based Social Network CS224W Project Report

Research on the allowable value of Lateral Breakthrough Error for Super long Tunnel from 20 to 50 kilometers

Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting

Crystal Springs Upland School Transportation Demand Management Plan. March 2016

Using Gini-index for Feature Weighting in Text Categorization

CrowdPath: A Framework for Next Generation Routing Services using Volunteered Geographic Information

Compressed Sensing Algorithm for Real-Time Doppler Ultrasound Image Reconstruction

Optimization of the Revenue of the New York City Taxi Service. using Markov Decision Processes

Fosca Giannotti et al,.

Map Matching with Inverse Reinforcement Learning

The development and application of an urban link travel time model using data derived from inductive loop detectors

Visual Traffic Jam Analysis based on Trajectory Data

AN ITERATIVE ALGORITHM FOR MATCHING TWO ROAD NETWORK DATA SETS INTRODUCTION

Congestion Analysis with Historical Travel Time Data in Seoul

A Novel Method to Estimate the Route and Travel Time with the Help of Location Based Services

On the Utilization of GSM Cell Id for Mobile Device Tracking and Notification

CHAPTER 5 ANT-FUZZY META HEURISTIC GENETIC SENSOR NETWORK SYSTEM FOR MULTI - SINK AGGREGATED DATA TRANSMISSION

ESTIMATING PARAMETERS FOR MODIFIED GREENSHIELD S MODEL AT FREEWAY SECTIONS FROM FIELD OBSERVATIONS

The Theoretical Framework of the Optimization of Public Transport Travel

Internet Engineering Task Force (IETF) Category: Informational. K. Michielsen Cisco Systems November 2011

Semantic Trajectory Mining for Location Prediction

COPYRIGHT PROTECTION OF PALETTE IMAGES BY A ROBUST LOSSLESS VISIBLE WATERMARKING TECHNIQUE *

Identifying The Stay Point Using GPS Trajectory of Taxis Hao Xiao 1,a, Wenjun Wang 2,b, Xu Zhang 3,c

A BTC-COMPRESSED DOMAIN INFORMATION HIDING METHOD BASED ON HISTOGRAM MODIFICATION AND VISUAL CRYPTOGRAPHY. Hang-Yu Fan and Zhe-Ming Lu

TM ALGORITHM TO IMPROVE PERFORMANCE OF OPTICAL BURST SWITCHING (OBS) NETWORKS

Subpixel Corner Detection Using Spatial Moment 1)

Random Traversing Based Reversible Data Hiding Technique Using PE and LSB

DELAY-CONSTRAINED MULTICAST ROUTING ALGORITHM BASED ON AVERAGE DISTANCE HEURISTIC

Fast Shortest Path Algorithm for Road Network and Implementation

Where Next? Data Mining Techniques and Challenges for Trajectory Prediction. Slides credit: Layla Pournajaf

Artificial Neural Network and Multi-Response Optimization in Reliability Measurement Approximation and Redundancy Allocation Problem

Aero-engine PID parameters Optimization based on Adaptive Genetic Algorithm. Yinling Wang, Huacong Li

Mobile 3D laser scanning technology application in the surveying of urban underground rail transit

Activity-Based Human Mobility Patterns Inferred from Mobile Phone Data: A Case Study of Singapore

ANNATTO: Adaptive Nearest Neighbor Queries in Travel Time Networks

Simulating Growth of Transportation Networks

DIGITAL ROAD PROFILE USING KINEMATIC GPS

ScienceDirect. Analytical formulation of the trip travel time distribution

Transcription:

AN IMPROVED TAIPEI BUS ESTIMATION-TIME-OF-ARRIVAL (ETA) MODEL BASED ON INTEGRATED ANALYSIS ON HISTORICAL AND REAL-TIME BUS POSITION Xue-Min Lu 1,3, Sendo Wang 2 1 Master Student, 2 Associate Professor Department of Geography, National Taiwan Normal University 3 Project Senior Technician National Center for Research on Earthquake Engineering TEL: +886-2-6630-0875, FAX:+886-2-6630-0858 E-Mail: xmlu@ncree.narl.org.tw Abstract In Taipei, current estimated time of arrival (ETA) is calculated only based on the distance and the estimated speed of the bus. In this research, the ETA model is improved by the historical and real-time bus position data. First of all, a spatial database of bus positions is established by receiving GNSS positioning data on each bus with a 25-second interval. The positions are corrected by map-matching algorithm. According to the difference of time and distance between each point, each period of bus travel time can be summarized as the basis of estimation. Using K Nearest Neighbor (K-NN) algorithm to search the most similar K records by the real-time bus position can served as the adjustment to the ETA model. Finally, the result shows that the proposed K-NN model can provide good accuracy, which the best mean absolute error is 140 seconds ;and the best mean absolute percentage error is 7.44%. Keywords: Estimated Time of Arrival(ETA), Global Navigation Satellite System(GNSS), Bus, Road Network, Real-time Traffic INTRODUCTION With the rapid development of Taipei city, the demand for public transport is obviously on the rise. In order to make the city s transport network operate more efficiently, a reliable Intelligent Transportation System(ITS) is therefore necessary. Nowadays, nearly every bus in Taipei city has already been equipped with the Global Navigation Satellite System (GNSS) receiver and most of the bus stops have LED displayers installed accordingly to show the Estimated Time of Arrival (ETA). However, the calculation of the current ETA is solely based on the distance and the estimated speed of the bus, which is not sufficiently accurate and turns out to be a possible factor that leads to the unwillingness to take a bus. Thus, to solve the problem, this research is aiming at the historical and real-time bus position data as an alternative. Before modeling the historical based ETA, some pre-processing is required to construct the historical database. First, we do map-matching algorithm to correct the GNSS data to match respective bus segments. M.A Quddus et al. (2007) discussed the limitations of a number of map-matching algorithms, including geometry, topology, probability theory, and other advanced map matching methods. Lou et al. (2009) took spatial and time factors into consideration, subsequently proposing the ST-algorithm to improve map-matching abilities. Strengthening on the ST algorithm, Hsueh et al. added the direction factor and formulated the STD-algorithm. The algorithms mentioned above are based on complex urban cases. In other words, They rely on the accuracy of matching a specific position in a complex urban spatial network. Differing from the research above, in our study, we duse routes data, which are provided by Taipei city Public Transportation Office, so that we can concentrate uniquely on temporal and directional factors. After building the historical database, we compare real-time data with historical data via K-nearest Neighbor (K-NN) algorithm. Liu et al. (2012) compared the K-NN with Artificial neural network (ANN) algorithm, finding out that K-NN produces better results. Chia-Ju Ho(2009) used the K-NN to test the K value, time window, and distance threshold setting with the buses in Taipei. He also found out that the square of the distance difference can make a better result. Yeh ( 2017) interpolated the historical data and re-adjust thresholds in different time period. In this paper, we study buses in Taipei city,excluding the inter-city buses. Our GNSS position data are provided by Public Transport Data Exchange(PTX) which belongs to the Ministry of Transportation and Communications (MOTC). 346

The transmission delay of GNSS data s return period is around 20-25 seconds. Finally, because of the timetable adjustment in April, the period of the route data are restricted between January 5th to the end of March. METHODOLOGY map-matching algorithm The fields of GNSS data provided by PTX consist of plate number, route name, direction, azimuth, duty status, latitude, longitude, route unique identity and GNSS update time. Every location coordinate received from GNSS needs to match with a route line. Each route s direction attribute includes outbound and inbound trajectories which are distinguished by the codes 0 and 1. First of all, in order to simplify the operation, we convert each route into trajectory points whose interval is 3 meters. Secondly, we correct the GNSS points with the shortest distance, if a selected GNSS point has the same direction attribute with a route point. However, considering only the shortest distance between GNSS points and route trajectory points usually ends up producing fault matchings, especially when two buses pass the same stop in different directions. Therefore, taking azimuth attribute into account and introducing a weight in the calculation of distance may improve the outcome. As mentioned above, a weighted matching point is calculated with the following equation: (1) (2) (3) Here is the weight of a route point j for the calculated distance ; is the difference between a GNSS point and the azimuth of route points t to t+1. If the difference is bigger than 180, then ; represents on a route named k the distance between a selected GNSS point and a route point ; represents the corrected point. Figure 1 shows an example of matching results. The rectangle in figure 1. is the nearest point between GNSS position and route point. Figure 2. shows the result we took azimuth into account. Figure1. the result of map matching by only considering shortest distance. 347

Figure2. the result of map matching by taking shortest distance and azimuth into account. Interpolate time and distance data After running the map-matching algorithm, the next step is to build a database. In order to make time and position data searchable, it is necessary to interpolate data into every route points. First, we calculate the average velocity between every two consecutive GNSS positions. Then we use the time and the distance from the origin point to interpolate time into every route point. As a result, for any time range at a route point, we can estimate the moments when the buses pass by in the range. The interpolated results are shown in the table 1 below: Table 1. interpolated data K-nearest Neighbor (K-NN) algorithm The K-NN algorithm searches for the K records which share the most similar features from the historical database and then estimate the bus arrive time by analyzing these records. The idea is, primarily, to compare the input feature vector which we want to estimate with historical data in a time window of fixed length. The similarity is measured by distance metric. In our case study, we adopt the Euclidean distance. Finally, the K-NN algorithm averages the most similar K records to estimate bus arrive time. The K-NN method is done as follows: Step1. Select a GNSS position and retrieve the last n-1 records of GNSS positions before, which we denote by a vector [,,, ]. n is a number which defines length of a vector. Step2. Calculate the time difference of the vector [,,, ]. The result is denoted [,,, ]. 348

Step3. Find vectors from the same columns as [,,,, ] from the database if a time [ time window.,, ] and calculate their time difference is between - T to + T. Here, T is the length of Step4. Calculate the residual of each vector vectors. Figure 3 shows the Step 1 to 4: by distance metric, and pick the k smallest Figure3. K-NN algorithm steps from 1 to 4 Step5. According to values of r, get the travel time from the starting position to target position from the database, and average all the values to obtain the estimated travel time. There are many parameters that can influence the estimations of the K-NN method. In this study, we discuss various K values and various lengthes of time window T. EXPERIMENT This research took buses in Taipei city as an example, except for highway buses. In order to verify the accuracy of this method, we have picked 11 bus routes. The routes that we picked are 0South, 1, 20, 74, 218, 220, 270, 284, 588, 600, 611,and 949. Figure 4 shows the distribution of those routes. The Database features the GNSS data dated from January 5, 2018 to March 9, 2018. We verify the result with the real-time data from March 20, 2018 to March 31, 2018. Figure4. Map of selsected routes 349

It is important for k-nn method to set parameters. In this experiment, The K value is tested for 5, 10, and 15; The length of time window is tested for 300, 600, and 900 seconds. Besides, this research also focuses on the difference between peak hours and off-peak periods. Hence, the periods between 8:00 to 9:00 and 17:00 to 18:00 are regarded as peak hours. On the contrary, the periods in 10:00 to 11:00 and 13:00 to 14:00 are regarded as off-peak ones. The study used the Mean Absolute Error(MAE) and Mean Absolute Percentage Error (MAPE) to evaluate the accuracy of our estimate. The formulas of MAE and MAPE are as follows: (4) (5) Here, is the forecasted value, is the observed value, and N is the number of samples. MAE shows the average of the absolute differences between the forecast value and the observe value, MAPE shows the relative difference of the observed value and forecasted value. RESULT Our experiment tested a number of parameters for MPE and MAPE analysis. The K-value by 5, 10, and 15 ;the length of time window T tested by 300, 600, and and 900 seconds. Starting and Ending points are randomly selected by computer. As the K-value increases, the number of Valid samples Less than or equal to origin ones. That is because some data are not long enough before a searching point. Table 2 shows the results of MAE and MAPE: Table 2. (a) the result of MAE; (b) the result of MAPE.(a) 5 10 15 300 164s(N=339) 140s(N=268) 154s(N=237) 600 163s(N=321) 141s(N=262) 159s(N=231) 900 189s(N=299) 158s(N=261) 159s(N=229) (b) 5 10 15 300 8.46% 7.44% 7.57% 600 8.39% 7.49% 7.79% 900 9.25% 7.84% 7.80% Where N is the sample sizes. Accroding to table 3, the optimal estimation time can be found, when K equals 10 and T equals 600 seconds. Conversely when K equals 5 and T equals 900 seconds, the worst one was found. The difference between the best and the worst MAE is up to 49 seconds. In addiction, when the K value is 5, the T value has a greater variation;when the K value is 5 or 10, the T value has a less variation. Finally, using parameters whose K equals 10 and T equals 600 seconds to test in difference of 4 periods. Table 3 shows the MAE and MAPE for the period from 8:00 to 9:00, 10:00 to 11:00, 13:00 to 14:00, and 17:00 to 18:00. The result shows that the MAE that we selected to represent the peak hour from 8:00 to 9:00 is up to 213 seconds, but the MAE from 13:00 to 14:00 is only 92 seconds. In other word, there is a significant difference between the peak hour and the off-peak period. This shows that this method should be improved if it is applied at different times. 350

Table 3. MAE and MAPE in peak hours and off-peak periods. Sample sizes MAE MAPE 8:00-9:00 24 213s 11.12% 10:00-11:00 14 115s 6.17% 13:00-14:00 34 92s 4.89% 17:00-18:00 21 187s 9.88% In this case study,k-nn algorithm search similar historical data vectors in a range of time. It is effective and simple to estimate bus arrive time. And the estimated error can be about two minutes. Basing on GNSS position data as estimated resource, the accuracy of map-matchin algorithm affect estimation result. In order to get better one, except for imporoving the map-matching algorithm, there are more factors should be considered. For instance, driving on a bus lane or not, driving on holidays or weekdays, and so on. CONCLUSION This paper used K-NN algorithm to build a model for bus ETA. First of all, GNSS position data needed to be preprocessed by a map-matching method. The proposed method considered not only the shortest distance but also the azimuzh attribute to correct GNSS position to a bus route points. Secondly the time data are calculated and interpolated so as to make data searchable by each bus route points. Finally, the K-NN algorithm is applied to search historical database and the average of the most similar K records is taken as the ETA. By comparing K-value and time window T, in our case study, the result showed that when the K value is 10 and T is 600 seconds, the MAE attains the minimum 140 seconds and the MAPE is 7.44%. In addition, the study also revealed the differences between the results when the method is applied to the peak and the off-peak periods. REFERENCES Chia-Ju Ho (2010), The Study of Urban Bus Arrival Time Prediction Using the k-nn Model, Master's Thesis of Department of Department of Transportation Technology and Management. Hsinchu: National Chiao Tung University, 46 pp. Mohammed A. Quddus, Washington Y. Ochieng, Robert B. Noland (2007), Current map-matching algorithms for transport applications: State-of-the art and future research directions, Transportation Research Part C: Emerging Technologies, 15(5):312-328 T. Liu, J. Ma, W. Guan, Y. Song and H. Niu (2012), "Bus Arrival Time Prediction Based on the k-nearest Neighbor Method," 2012 Fifth International Joint Conference on Computational Sciences and Optimization, Harbin, pp. 480-483. Wedin, D. (2015). Travel Time Estimation in Stockholm Using Historical GPS Data (Dissertation). Master's Thesis of Department of Information Technology.Uppsala, Uppsala University, 51pp. Retrieved from http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva- 260692 Yeh, Cheng-Tsung (2017), The Study of Estimating Bus Arrival Time by Using Public Transit Real Time Data, Master's Thesis of Department of Department of Transportation Technology and Management. Hsinchu: National Chiao Tung University, 47 pp. Yin Lou, Chengyang Zhang, Yu Zheng, Xing Xie, Wei Wang, Yan Huang(2009), Map-matching for low-sampling-rate GPS trajectories, Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, November 04-06, 2009, Seattle, Washington. Yu-Ling Hsueh, Ho-Chian Chen(2017), Map matching for low-sampling-rate GPS trajectories by exploring real-time moving directions, Information Sciences, Volumes 433 434, 2018, Pages 55-69, ISSN 0020-0255, https://doi.org/10.1016/j.ins.2017.12.031. 351