An ELM-based traffic flow prediction method adapted to different data types Wang Xingchao1, a, Hu Jianming2, b, Zhang Yi3 and Wang Zhenyu4

Similar documents
Prediction of traffic flow based on the EMD and wavelet neural network Teng Feng 1,a,Xiaohong Wang 1,b,Yunlai He 1,c

5th International Conference on Information Engineering for Mechanics and Materials (ICIMM 2015)

Nelder-Mead Enhanced Extreme Learning Machine

Extreme Learning Machine: A New Learning Scheme of Feedforward Neural Networks

Regularized Committee of Extreme Learning Machine for Regression Problems

A Data Classification Algorithm of Internet of Things Based on Neural Network

Accuracy of Matching between Probe-Vehicle and GIS Map

Feature selection in environmental data mining combining Simulated Annealing and Extreme Learning Machine

Solving Large Regression Problems using an Ensemble of GPU-accelerated ELMs

Effect of the PSO Topologies on the Performance of the PSO-ELM

Power Load Forecasting Based on ABC-SA Neural Network Model

Analysis of Randomized Performance of Bias Parameters and Activation Function of Extreme Learning Machine

Application of Geometry Rectification to Deformed Characters Recognition Liqun Wang1, a * and Honghui Fan2

Real-Time Image Classification Using LBP and Ensembles of ELM

The Study of Intelligent Scheduling Algorithm in the Vehicle ECU based on CAN Bus

Efficient Degree Elevation and Knot Insertion for B-spline Curves using Derivatives

Image Classification using Fast Learning Convolutional Neural Networks

Warship Power System Survivability Evaluation Based on Complex Network Theory Huiying He1,a, Hongjiang Li1, Shaochang Chen1 and Hao Xiong1,2,b

The Faults Diagnostic Analysis for Analog Circuit Faults Based on Firefly Algorithm and Extreme Learning Machine

arxiv: v1 [cs.lg] 25 Jan 2018

Traffic Flow Prediction Based on the location of Big Data. Xijun Zhang, Zhanting Yuan

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK

Package elmnnrcpp. July 21, 2018

An Algorithm of Parking Planning for Smart Parking System

An Improved Method of Vehicle Driving Cycle Construction: A Case Study of Beijing

Large Scale Wireless Indoor Localization by Clustering and Extreme Learning Machine

Research on friction parameter identification under the influence of vibration and collision

Research on the Application of Digital Images Based on the Computer Graphics. Jing Li 1, Bin Hu 2

FUZZY C-MEANS ALGORITHM BASED ON PRETREATMENT OF SIMILARITY RELATIONTP

Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis

A methodology for Building Regression Models using Extreme Learning Machine: OP-ELM

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

Extending reservoir computing with random static projections: a hybrid between extreme learning and RC

Data Discretization Using the Extreme Learning Machine Neural Network

An Efficient Optimization Method for Extreme Learning Machine Using Artificial Bee Colony

Fully complex extreme learning machine

ANN-Based Modeling for Load and Main Steam Pressure Characteristics of a 600MW Supercritical Power Generating Unit

Research on time optimal trajectory planning of 7-DOF manipulator based on genetic algorithm

A faster model selection criterion for OP-ELM and OP-KNN: Hannan-Quinn criterion

A Robust Image Zero-Watermarking Algorithm Based on DWT and PCA

An Improved KNN Classification Algorithm based on Sampling

FSRM Feedback Algorithm based on Learning Theory

Extreme Learning Machines. Tony Oakden ANU AI Masters Project (early Presentation) 4/8/2014

Fast Learning for Big Data Using Dynamic Function

Support Vector Regression for Software Reliability Growth Modeling and Prediction

A New Fuzzy Neural System with Applications

Adaptive osculatory rational interpolation for image processing

Open Access Research on the Prediction Model of Material Cost Based on Data Mining

Research Article An Improved Kernel Based Extreme Learning Machine for Robot Execution Failures

Parameter optimization model in electrical discharge machining process *

CLASSIFICATION APPROACH FOR BIG DATA DRIVEN TRAFFIC FLOW PREDICTION USING APACHE SPARK

Hotel Recommendation Based on Hybrid Model

Road Network Traffic Congestion Evaluation Simulation Model based on Complex Network Chao Luo

Fault Diagnosis of Power Transformers using Kernel based Extreme Learning Machine with Particle Swarm Optimization

K-coverage prediction optimization for non-uniform motion objects in wireless video sensor networks

A Boosting-Based Framework for Self-Similar and Non-linear Internet Traffic Prediction

High Resolution Remote Sensing Image Classification based on SVM and FCM Qin LI a, Wenxing BAO b, Xing LI c, Bin LI d

THE discrete multi-valued neuron was presented by N.

Design and Research of Adaptive Filter Based on LabVIEW

Research on the Checkpoint Server Selection Strategy Based on the Mobile Prediction in Autonomous Vehicular Cloud

manufacturing process.

3 The standard grid. N ode(0.0001,0.0004) Longitude

Prior Knowledge Input Method In Device Modeling

Hybrid ant colony optimization algorithm for two echelon vehicle routing problem

Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting

AN IMPROVED TAIPEI BUS ESTIMATION-TIME-OF-ARRIVAL (ETA) MODEL BASED ON INTEGRATED ANALYSIS ON HISTORICAL AND REAL-TIME BUS POSITION

Research on Evaluation Method of Product Style Semantics Based on Neural Network

Research on Heterogeneous Communication Network for Power Distribution Automation

UAV Motion-Blurred Image Restoration Using Improved Continuous Hopfield Network Image Restoration Algorithm

Models and Algorithms for Shortest Paths in a Time Dependent Network

An Efficient Learning Scheme for Extreme Learning Machine and Its Application

Impact of Variances of Random Weights and Biases on Extreme Learning Machine

Trajectory Planning for Reentry Maneuverable Ballistic Missiles

Citation for the original published paper (version of record):

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article

Time Series Clustering Ensemble Algorithm Based on Locality Preserving Projection

Object Tracking Algorithm based on Combination of Edge and Color Information

An adaptive container code character segmentation algorithm Yajie Zhu1, a, Chenglong Liang2, b

Utilizing Restricted Direction Strategy and Binary Heap Technology to Optimize Dijkstra Algorithm in WebGIS

Learning the Three Factors of a Non-overlapping Multi-camera Network Topology

CHAOTIC ANT SYSTEM OPTIMIZATION FOR PATH PLANNING OF THE MOBILE ROBOTS

Research on Algorithm Schema of Parametric Architecture Design Based on Schema Theory

Neurocomputing 102 (2013) Contents lists available at SciVerse ScienceDirect. Neurocomputing. journal homepage:

Developing an intelligent sign inventory using image processing

High Capacity Reversible Watermarking Scheme for 2D Vector Maps

FAST HASH FUNCTION BASED ON BCCM ENCRYPTION ALGORITHM FOR E-COMMERCE (HFBCCM)

Automatically Balancing Intersection Volumes in A Highway Network

Shared-network scheme of SMV and GOOSE in smart substation

Inversion of Array Induction Logs and Its Application

Landslide Monitoring Point Optimization. Deployment Based on Fuzzy Cluster Analysis.

Computer Aided Drafting, Design and Manufacturing Volume 26, Number 2, June 2016, Page 18

Research on the Algorithms of Lane Recognition based on Machine Vision

Research on the New Image De-Noising Methodology Based on Neural Network and HMM-Hidden Markov Models

Adaptive replica consistency policy for Kafka

A Rapid Automatic Image Registration Method Based on Improved SIFT

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

TensorFlow and Keras-based Convolutional Neural Network in CAT Image Recognition Ang LI 1,*, Yi-xiang LI 2 and Xue-hui LI 3

A Kind of Wireless Sensor Network Coverage Optimization Algorithm Based on Genetic PSO

A rapid peak detection algorithm

Transcription:

6th International Conference on Information Engineering for Mechanics and Materials (ICIMM 206) An ELM-based traffic flow prediction method adapted to different data types Wang Xingchao, a, Hu Jianming2, b, Zhang Yi3 and Wang Zhenyu4 Department of Control Science and Engineering, Tongji University, Shanghai, 20804, China 2, 3, 4 Department of Automation, Tsinghua University, Beijing, 00084, China a email: 352805@tongji.edu.cn, bemail: hujm@mail.tsinghua.edu.cn Keywords: ELM; non-uniform data; traffic flow prediction Abstract. Traffic flow prediction has been the crucial aspect in the Intelligent Transport Systems (ITS) and it will help ITS to induce the traffic flow rationally. In this paper, a prediction model based on ELM has been discussed through testing this model with actual data. This model takes advantage of the speed and ability of handling nonlinear system of the ELM algorithm. By analyzing the actual data in Haining County in 206, the feasibility and advantages of ELM prediction model have been shown when compared with other algorithms. Analyzing the non-uniform and lost data, this model shows the performance that the model can be used in actual application. These characters can remedy the shortcomings which other models have. Introduction With the development of society, the need of Intelligent Transport Systems (ITS) has been stronger in case of the traffic jam. Moreover, in the small-medium cities this problem has been more serious because of the increasing vehicles and narrow road. Short term travel flow prediction based on the time series takes an important role in the ITS to ease the road. Traditional prediction models include: Kalman filter, auto-regressive integrated moving average model (ARIMA), neural networks, support vector regression (SVR). Besides, neural networks has been another major tool to predict the travel flow. However, these current prediction models are imperfect to predict. Although neural networks can predict accurately, but the time used by the processing is much longer than other models. The ARIMA is fast but not accurate. Therefore the research wants to take advance of the speed and accuracy of Extreme Learning Machine (ELM) in order to predict the travel flow better. What s more, most forecasting models cannot handling the non-uniform sampling data and missing data which may be attained in the actual society. If the prediction models always use the method based on uniform sampling data to handling these different type data, the prediction results may be not accurate. Thus, the research also test this prediction model based on the different datasets. For further accuracy and rapidity travel flow prediction, this forecasting model based on ELM has been used to predict the travel flow in this paper. This experiment used ARIMA and wavelet neural network (WNN) to be the comparison to test the performances of this model. Besides, the research also uses the non-uniform and missing data to prove that this model can satisfy the need of actual application. Extreme Learning Machine The ELM algorithm (proposed by Huang) is a fast learning method. This method is a kind of Single-Layer Feedforward Neural Networks (SLFNs). The activation function in the hidden layer can be infinitely differentiable so the weight of input can be randomly settled. Moreover, the least squares solutions can be obtained by minimizing quadratic loss function. As a result, the iteration is not required when the network parameters are under randomly determining, which benefits the reducing of the amount of calculation and the time of forecasting. The network structure of ELM 206. The authors - Published by Atlantis Press 407

shows in the following figure. Fig.. ELM algorithm diagram Theorem2.. For N arbitrary distinct samples (x i,t i), x i =[x i,x i2,., x in ] T R n is the input vector(n is the number of input node), and t i =[t i,t i2,., t im ] T R m is the output vector. G(x) is the activation function. The model can be described as the following function. m m bigi( xj) = big( ωi xj + bj) = tj, () i= i= j=,2,,n, Where, ω i=[ω i, ω i2,,ω in ] T is the weight vector to connect the i th hidden node and the input nodes, b i is the threshold of the i th hidden node. β i=[β i,β i2,,β im ] T is the weight vector, which connects the i th hidden node and the output nodes. The N equations can be simplified as the following equation. Hβ = T, (2) g( w x+ b) g( wm x+ bm) H =, (3) g( w xn + b) g( wm xn + bm) T β= [ β, β,, β ], (4) i i2 im k m T T t t2 tm k N =[,,, ], (5) Where, H is the hidden layer output matrix; β is the weight vector of output layer; T is the desired output and k is the output node number. The unique smallest norm least squares solution of the equation () is: β = H + T, (6) H + is the Moore-Penrose generalized inverse of output matrix H. Steps of ELM algorithm: () Randomly assign input weights and hidden layer threshold; (2) Calculate the hidden layer output vector H; (3) Calculate the output weights vector β=h + T. Data type In this section, three types data which have been used in this paper will be introduced. Most travel prediction models base on the data sampled uniform, but actual data are sampled with some non-uniform even lost. Thus, this paper uses 3 kinds of data to test the model: uniform sampling data; non-uniform sampling data; missing data. Each data type has the unique parameter which will influence the dataset. However, most test data are uniform sampled and the non-uniform sampling data or missing data cannot be obtained. Thus, in this paper the research adopts the randomly merge method to obtain the non-uniform data and the randomly lost method to attain the missing data. When handling different data, this research uses different ways to predict. 408

Table Data type Data Parameter Signal Uniform sampling data Merge number Merge_Num Non-uniform sampling Max merge-number Merge_Max_Num data Missing data Missing rate Lost_Rate Prediction algorithm based on ELM To predict based on different type data, the algorithm differs when handling the input dataset. For non-uniform sampling data the method is the same as the uniform data but the sequences number is fewer. Handling missing data, this research uses the abandon forecasting method which means that the algorithm will abandon the days data when meeting the lost data. The structure of this prediction algorithm shows in the flowing diagram Raw data Data handling (merge or lost) Uniform and non-uniform Lost data: abandon prediction Experimental data ELM training ELM forecasting Fig. 2. Steps of prediction When the research wants to forecast the j+ th sequence data, we can get the j th traffic flow data. It is that there are N days data and one day has T sequences. Therefore, the data vector is N*T and the i th observation point s j th traffic flow data is x ij. The training input of ELM is: xn p+, j+ 2 p2 x N p+, j+, xn, j+ 2 p2 x N, j+ p p2 (7) The training output is: x N p+, j+, x N, j+ p (8) The testing input is: xn, j+ 2 p2 x N, j (9), p2 Attain forecasting result Raw Data The original data are used in this paper is collected in Haining County, Jiaxing City, Zhejiang Province, China by microwave in 206. The raw data uniform sampled have been detected all days and the detection interval is minute. The dataset covers 7 observation points and 20 days with one day has 440 sequences. In this case, the first 9 days data are training data and the 20th day data are testing data. In the following figure, the locations of observation points are showed. 409

Fig. 3. Distribution of observation points When the research plot the data and several properties can be found: little travel flow volume; strong fluctuation; no obvious crest. So the research merges and cuts out the raw data to find out the experimental data whose detection interval is 5 minutes. Fig. 4. Raw data The forecasting process is programmed in Matlab 204a environment running in an Intel Core i5-3230m, 2.6GHZ CPU. Running the program with the history data and testing data will get the forecasting performance. Prediction results Uniform data. In this section, the experiment choose the 5th observation point to be the test point. Based on the uniform sampling data and comparison with ARIMA and WNN, the performances of this model will be shown in table and figures. Table 2 Time and accuracy comparison Model Time Error ELM 0.375s 7.66% ARIMA 0.570s 3.23% WNN 34.63s 0.43% In the table 2, the speed can be found faster and more accurate than other two models. The forecasting are showed in the following figures. The first one is the ELM forecasting results. 40

Fig. 5. Prediction comparison figure Non-uniform data. In this paper, the max merge number is 2 which means the adjacent data have 50% probability to combine. By using this method, we can get the traffic volume which is non-uniform sampled to do experiment. And the prediction results show in the following picture: Fig. 6. Non-uniform sampling data prediction From the figure 5, the result shows that the error rate of this method is 8.75%. Although this result is not good when compare with the uniform result. But if we analyze the data, the extremely strong fluctuation is the important influence to predict. And in the actual environment the random non-uniform sampling data hardly exist. Therefore, this method can also apply to predict based on non-uniform data. Missing data. This section will test whether this model can predict based on the data with lost data. The 5 th observation point s data are used in this test. By setting the lost rate to get the sequences with missing data. This research sets the lost rate is 0%., which means every datum has 0% probability to lose. Following picture shows the prediction result. Fig. 7. Missing sampling prediction 4

In this picture, the prediction tendency can follow the real tendency well. By calculating, the error rate is.74%. In case of the using of abandon prediction method, the strong fluctuation cannot influence the performances of forecasting. Moreover, this model can predict accurately in this environment that some data lost. Therefore, this method will help the ITS to predict traffic flow when the actual data miss. Conclusion This paper concludes prediction methods which have been used and offer the obvious shortcomings in these methods. Besides, this paper discusses the time series prediction method based on ELM. By handling raw data, the training dataset and testing input will be determined. And because of that the weight of hidden layer can be randomly determined and the hidden layer is single, the ELM algorithm is faster than the normal neural networks. Moreover, the ability to adapt to the nonlinear system make the model can predict better. By analyzing the actual data in Haining County in 206, the performances can be showed in tables and figures when it compared with other algorithms. Moreover, when testing the model with non-uniform and missing data, this method can satisfy the need of the actual application. Meanwhile, this model can provide better support for traffic guidance. Acknowledgement This work is supported by the national science and technology support program (204BAG03B0), National Natural Science Foundation China 6673232, 6273238, and Tsinghua University Program (203089307). References [] Guang-Bin Huang, Qin-Yu Zhu, Chee-kheong Siew, Extreme learning machine: Theory and applications, Neurocomputing 70 (2006) 489-50. [2] Guang-Bin Huang, What are Extreme Learning Machines? Filling the Gap Between Frank Rosenblatt s Dream and John von Neumann s Puzzles, 205.04.09, Cogn Compute (205) 7:263-278. [3] Lun Li, Dong Wang, Zhu Xiao, and Xiaohong Li, Urban Road Travel Time Prediction based on ELM, 5th International Conference on Advanced Materials and Computer Science (ICAMCS 206)(206). [4] Oh S, Byon Y J, Jang K, et al., Short-term Travel-time Prediction on Highway: A Review of the Data-driven Approach, Transport Reviews A Transnational Transdisciplinary Journal, vol.35, no., pp.4-32 (205). [5] Tianfan Ye, A pruning algorithm with L/2 regularizer for extreme learning machine, Journal of Zhejiang University Science C(Computers & Electronics)(204). [6] Jianding Xiao, Active set strategy of optimized extreme learning machine, Chinese Science Bulletin (204). [7] Baojian Li, Monthly discharge forecasting using wavelet neural networks with extreme learning machine, Science China Technological Sciences (204). 42