Thesis Open Access
HABTU REDA
Estimating public bus arrival times and delivering accurate arrival time information to
passengers are critical for making public transportation more user-friendly and thereby
increasing its competitiveness among various forms of transportation. However public bus
arrival time prediction remains major bottlenecks With traffic heterogeneity in composition and
diversity of vehicles, as well as a big pedestrian population combined with inadequate lane use,
predicting the arrival time of public buses at stations is a severe concern.. The main objective of
this study is to apply machine learning algorithms to predict bus arrival time. The data was
collected from Addis Ababa Sheger Public Bus Transport. Random Forest, Gradient Boosting,
Artificial Neural Network, K-Nearest Neighbors and Support Vector Machine algorithms are
applied to build the models and to compare and choose the best model to predict the bus arrival
time. After selecting the features and algorithms, different data preprocessing tasks like checking
outliers, missing values and data reduction are done. Finally, 140,000 instances of dataset are
used to train and build the model. The prepared dataset is partitioned into 90% training and 10%
testing set. Beginning Date, Beginning Time, End Date, Time Range, Mileage, Duration, Initial
latitude, Initial longitude, Final latitude, Final longitude, and End Time were used as input
features for developing the model. Based on the experiment result the Random Forest algorithm
achieved a better performance with R-squared score of 0.994, MAE of 0.812, RMSE of 3.780
and MSE of 14.28.
Name | Size | |
---|---|---|
f1042664640.pdf
md5:d45cbd5fd89e1d88df48cadeabb670f7 |
647.9 kB | Download |
All versions | This version | |
---|---|---|
Views | 0 | 0 |
Downloads | 0 | 0 |
Data volume | 0 Bytes | 0 Bytes |
Unique views | 0 | 0 |
Unique downloads | 0 | 0 |