Predicting Bus Passenger Flow and Prioritizing Influential Factors Using Multi-Source Data
Predicting Bus Passenger Flow and Prioritizing Influential Factors Using Multi-Source Data
ROLL NO : 110520504025.
GROUP : MSC(COMPUTER SCIENCE) 2ND YEAR.
COLLEGE : JAGRUTHI DEGREE & PG COLLEGE.
EXISTING
SYSTEM
In recent years, the advent and prevalence of deep learning models have provoked
a storm in the field of transportation. There are also a handful of studies on the
passenger flow prediction using deep learning models. Liu and Chen [20]
developed a multi-stage deep learning architecture to forecast the passenger flow
for bus rapid transit stations.
To defeat the drawbacks of single models and take advantage of different models,
an increasing number of researchers have developed hybrid models by integrating
different single models. their method integrates empirical mode decomposition and
ANN. Ma et al. (2014) [28] presented an integrating approach with interactive
multi-model pattern in the short-term passenger demand forecasting.
Disadvantages
• In the existing work, the system did not implement novel scaled
stacking gradient boosting decision tree (SS-GBDT) model.
• This system is less performance due to lack of Implicit linkage
between features and predicted labels.
PROPOSED
SYSTEM
The system proposes a novel scaled stacking gradient boosting decision tree (SS-
GBDT) model to predict bus passenger flow with multi-source datasets. SS-GBDT
includes two modules: the prior feature-generation module and the subsequent
GBDT-prediction module. The prior module entails a couple of basic models with
similar performance, which generates several enhanced features of multi-source
data by stacking process.
Results show that SS-GBDT not only presents superiority in both prediction
accuracy and stability, but can also better handle the multicollinearity issue with
multisource data. It can also prioritize the influential factors on passenger flow
prediction. The prediction model is flexible and scalable, which enables the
integration of various influential factors in the presence of big data.
Advantages
• The system is more effective since it presents Scaled Stacking Process for
Multi-Source Data.
• The system is accurate since it is implemented novel scaled stacking gradient
boosting decision tree (SS-GBDT) model.
SYSTEM
REQUIREMENTS
SOFTWARE REQUIREMENTS:
• Operating system : Windows 7 Ultimate.
• Front-End : Python.
• Back-End : Django-ORM