Predicting Bus Passenger Flow and Prioritizing Influential Factors Using Multi-Source Data

This document describes a student named Vineeth Kumar who is proposing a novel scaled stacking gradient boosting decision tree (SS-GBDT) model to accurately predict bus passenger flow using multi-source data. The proposed SS-GBDT model includes a prior feature generation module that generates enhanced features from multiple data sources using a scaled stacking method, and a subsequent GBDT prediction module. The model aims to better handle data correlations and prioritize influential prediction factors compared to existing approaches.

Uploaded by

SLDFLAG

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Predicting Bus Passenger Flow and Prioritizing Influential Factors Using Multi-Source Data

Uploaded by

SLDFLAG

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

NAME : VINEETH KUMAR.

ROLL NO : 110520504025.
GROUP : MSC(COMPUTER SCIENCE) 2ND YEAR.
COLLEGE : JAGRUTHI DEGREE & PG COLLEGE.

Predicting Bus Passenger Flow and Prioritizing

Influential Factors Using Multi-Source Data Scaled
Stacking Gradient Boosting Decision Trees
ABSTRACT
Accurate bus passenger flow prediction contributes to informed decisions and full
utilization of transit supply. Passenger flow is affected by an extensive range of
attributes featuring travel environment, which can be collected through multi-
source information. A successful prediction model should not only fully utilize the
latent knowledge hidden in multisource data, but also address the resulting
multicollinearity issue. Based on this principle, we propose a novel scaled stacking
gradient boosting decision tree (SS-GBDT) model to predict bus passenger flow
with multi-source datasets.
SS-GBDT includes two modules:
• The prior feature-generation module and
• The subsequent GBDT-prediction module.
The prior module entails a couple of basic models with similar performance,
which generates several enhanced features of multi-source data by stacking
process. Particularly, we devise a scaled stacking method by introducing a quasi-
attention based mechanism. It can also prioritize the influential factors on
passenger flow prediction. The prediction model is flexible and scalable, which
enables the integration of various influential factors in the presence of big data.

EXISTING
SYSTEM

In contrast to the parametric approaches, the principle of the non-parametric

approaches is to build a nonlinear relationship between the input variables and the
output variables without prior knowledge. Artificial neural network (ANN) models
can handle the complex relationships in datasets and have gained wide popularity
in transportation. However, the drawback of ANN is the potential occurrence of
over-fitting or under-fitting. As another non-parametric models, support vector
machine (SVM) and support vector regression (SVR) models can potentially
overcome the drawbacks of neural networks and address the issues of nonlinearity,
small samples, high dimension, local minima and over-fitting. Markovi´c et al.

In recent years, the advent and prevalence of deep learning models have provoked
a storm in the field of transportation. There are also a handful of studies on the
passenger flow prediction using deep learning models. Liu and Chen [20]
developed a multi-stage deep learning architecture to forecast the passenger flow
for bus rapid transit stations.

To defeat the drawbacks of single models and take advantage of different models,
an increasing number of researchers have developed hybrid models by integrating
different single models. their method integrates empirical mode decomposition and
ANN. Ma et al. (2014) [28] presented an integrating approach with interactive
multi-model pattern in the short-term passenger demand forecasting.
Disadvantages
• In the existing work, the system did not implement novel scaled
stacking gradient boosting decision tree (SS-GBDT) model.
• This system is less performance due to lack of Implicit linkage
between features and predicted labels.

PROPOSED
SYSTEM

The system proposes a novel scaled stacking gradient boosting decision tree (SS-
GBDT) model to predict bus passenger flow with multi-source datasets. SS-GBDT
includes two modules: the prior feature-generation module and the subsequent
GBDT-prediction module. The prior module entails a couple of basic models with
similar performance, which generates several enhanced features of multi-source
data by stacking process.

Results show that SS-GBDT not only presents superiority in both prediction
accuracy and stability, but can also better handle the multicollinearity issue with
multisource data. It can also prioritize the influential factors on passenger flow
prediction. The prediction model is flexible and scalable, which enables the
integration of various influential factors in the presence of big data.

Advantages
• The system is more effective since it presents Scaled Stacking Process for
Multi-Source Data.
• The system is accurate since it is implemented novel scaled stacking gradient
boosting decision tree (SS-GBDT) model.

SYSTEM
REQUIREMENTS

➢ H/W System Configuration:-

➢ Processor - Pentium –IV

➢ RAM - 4 GB (min)
➢ Hard Disk - 20 GB
➢ Key Board - Standard Windows Keyboard
➢ Mouse - Two or Three Button Mouse
➢ Monitor - SVGA

SOFTWARE REQUIREMENTS:
• Operating system : Windows 7 Ultimate.

• Coding Language : Python.

• Front-End : Python.

• Back-End : Django-ORM

• Designing : Html, css, javascript.

• Data Base : MySQL (WAMP Server).

A Multi Stream Feature Fusion Approach for Traffic Prediction
No ratings yet
A Multi Stream Feature Fusion Approach for Traffic Prediction
5 pages
Roadsideunit
No ratings yet
Roadsideunit
37 pages
2020-If2.6-SAPTM- Towards High-throughput Per-flow Traffic Measurement With a Systolic Array-like Architecture on FPGA
No ratings yet
2020-If2.6-SAPTM- Towards High-throughput Per-flow Traffic Measurement With a Systolic Array-like Architecture on FPGA
15 pages
research paper1 -AI in education
No ratings yet
research paper1 -AI in education
12 pages
Sensors 23 02208 v2
No ratings yet
Sensors 23 02208 v2
26 pages
Building An Intrusion Detection System Using A Filter
No ratings yet
Building An Intrusion Detection System Using A Filter
3 pages
LPDB Lightweight Policy Driven Blockchain With Batch Verification for Rail Transit Systems
No ratings yet
LPDB Lightweight Policy Driven Blockchain With Batch Verification for Rail Transit Systems
8 pages
Smart Traffic Forecasting: Leveraging Adaptive Machine Learning and Big Data Analytics For Traffic Flow Prediction
No ratings yet
Smart Traffic Forecasting: Leveraging Adaptive Machine Learning and Big Data Analytics For Traffic Flow Prediction
10 pages
Efficient Cache-Supported Path Planning On Roads
No ratings yet
Efficient Cache-Supported Path Planning On Roads
24 pages
The Three Tier Abstract
No ratings yet
The Three Tier Abstract
5 pages
Shiva_paper
No ratings yet
Shiva_paper
5 pages
جديد
No ratings yet
جديد
54 pages
Energy Efficient Data Transmission Using Approximate Dynamic Programming in Mobile Cloud Computing
No ratings yet
Energy Efficient Data Transmission Using Approximate Dynamic Programming in Mobile Cloud Computing
14 pages
10.1007@s11277 020 07578 7
No ratings yet
10.1007@s11277 020 07578 7
13 pages
Research Paper
No ratings yet
Research Paper
21 pages
Ieee 2010 Titles: Data Alcott Systems (0) 9600095047
No ratings yet
Ieee 2010 Titles: Data Alcott Systems (0) 9600095047
6 pages
Raspberry Pi As A Low-Cost Data Acquisition System For Human Powered Vehicles
No ratings yet
Raspberry Pi As A Low-Cost Data Acquisition System For Human Powered Vehicles
12 pages
Website Traffic Forecasting
No ratings yet
Website Traffic Forecasting
32 pages
Blockchain Based Data Integrity Verification for Large-Scale IoT Data
No ratings yet
Blockchain Based Data Integrity Verification for Large-Scale IoT Data
9 pages
10.Data-Driven Design of Fog Computing Aided
No ratings yet
10.Data-Driven Design of Fog Computing Aided
5 pages
Informational Braess Paradox - The Effect of Infor (1)
No ratings yet
Informational Braess Paradox - The Effect of Infor (1)
26 pages
Next Wave Mobility
No ratings yet
Next Wave Mobility
13 pages
UNet
No ratings yet
UNet
18 pages
of Data Replication
No ratings yet
of Data Replication
24 pages
An Efficient Privacy-Preserving Credit Score System Based On Non Interactive Zero-Knowledge Proof
0% (1)
An Efficient Privacy-Preserving Credit Score System Based On Non Interactive Zero-Knowledge Proof
5 pages
Bus Tracking System
No ratings yet
Bus Tracking System
10 pages
SDN Apis For Multi-Sided Resource Management in Any Networks
No ratings yet
SDN Apis For Multi-Sided Resource Management in Any Networks
15 pages
Comparative Analysis of AGBFM and IWOFM With Forecasting Models LSSVM-PSO, LSSVM-ACO and LSSVM-WOA
No ratings yet
Comparative Analysis of AGBFM and IWOFM With Forecasting Models LSSVM-PSO, LSSVM-ACO and LSSVM-WOA
17 pages
Monitoring The Application-Layer Ddos Attacks For Popular Websites
No ratings yet
Monitoring The Application-Layer Ddos Attacks For Popular Websites
3 pages
A Multi-Stream Feature Fusion Approach For Traffic Prediction.
No ratings yet
A Multi-Stream Feature Fusion Approach For Traffic Prediction.
5 pages
Netdiffusion: Network Data Augmentation Through Protocol-Constrained Traffic Generation
No ratings yet
Netdiffusion: Network Data Augmentation Through Protocol-Constrained Traffic Generation
32 pages
Dotnet Projects: 1. A Coupled Statistical Model For Face Shape Recovery From Brightness Images
No ratings yet
Dotnet Projects: 1. A Coupled Statistical Model For Face Shape Recovery From Brightness Images
12 pages
Mobile: 9243101428, 7019755620: Highblix - Final Year Projects - Bangalore
No ratings yet
Mobile: 9243101428, 7019755620: Highblix - Final Year Projects - Bangalore
29 pages
2008 11 CYMExpress English
No ratings yet
2008 11 CYMExpress English
2 pages
Traffic Aware Abstract
No ratings yet
Traffic Aware Abstract
4 pages
Helmet Detection Using Machine Learning and Automatic License Final
75% (4)
Helmet Detection Using Machine Learning and Automatic License Final
47 pages
Learning Customer Behaviors For Effective Load Forecasting
No ratings yet
Learning Customer Behaviors For Effective Load Forecasting
7 pages
Blockchain-Based Framework for Traffic Event Verification in Smart Vehicles
No ratings yet
Blockchain-Based Framework for Traffic Event Verification in Smart Vehicles
8 pages
An Effective and Secure Routing System Using Intelligent Water Drop Approach
No ratings yet
An Effective and Secure Routing System Using Intelligent Water Drop Approach
9 pages
Bus Pass System Revolutionizing Ticket Booking.pptx 2
No ratings yet
Bus Pass System Revolutionizing Ticket Booking.pptx 2
16 pages
Evaluacion Instrumentacion
No ratings yet
Evaluacion Instrumentacion
5 pages
Thesis On Vehicular Ad Hoc Network
100% (2)
Thesis On Vehicular Ad Hoc Network
8 pages
Paper 3
No ratings yet
Paper 3
6 pages
Data Processing For Large Database Using Mapreduce Approach Using Apso
No ratings yet
Data Processing For Large Database Using Mapreduce Approach Using Apso
59 pages
41 Submission
No ratings yet
41 Submission
14 pages
Complexity Problems Handled by Big Data Prepared by Shreeya Sharma
No ratings yet
Complexity Problems Handled by Big Data Prepared by Shreeya Sharma
9 pages
M Tech Seminar Topic
No ratings yet
M Tech Seminar Topic
11 pages
Open Source GMNS
No ratings yet
Open Source GMNS
124 pages
Java Projects On 2013 Ieee Papers
No ratings yet
Java Projects On 2013 Ieee Papers
7 pages
Major Project Final
No ratings yet
Major Project Final
21 pages
Flow-Based Network Traffic Generation Using Generative Adversarial Networks
No ratings yet
Flow-Based Network Traffic Generation Using Generative Adversarial Networks
37 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
3 pages
Java IEEE Abstracts
No ratings yet
Java IEEE Abstracts
24 pages
unit-2
No ratings yet
unit-2
16 pages
VIKASYS Java Proj With Abstract 2011
No ratings yet
VIKASYS Java Proj With Abstract 2011
12 pages
Unit 2
No ratings yet
Unit 2
19 pages
Zhao Sood Cali b
No ratings yet
Zhao Sood Cali b
33 pages
Traffic Flow Prediction For Intelligent Transporta
No ratings yet
Traffic Flow Prediction For Intelligent Transporta
8 pages
Fault IC Detection PPT[1]
No ratings yet
Fault IC Detection PPT[1]
30 pages
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
From Everand
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
Mustafa Al-Dori
4/5 (1)
2020 Icde Paper
No ratings yet
2020 Icde Paper
13 pages
Traffic Sign Board Recognition and Voice Alert System Using Convolutional Neural Network
No ratings yet
Traffic Sign Board Recognition and Voice Alert System Using Convolutional Neural Network
2 pages
Authentication and Key Agreement Based On Anonymous Identity For Peer-To-Peer Cloud
0% (1)
Authentication and Key Agreement Based On Anonymous Identity For Peer-To-Peer Cloud
7 pages
Detection of Cyberbullying On Social Media Using Machine Learning
No ratings yet
Detection of Cyberbullying On Social Media Using Machine Learning
5 pages
Ethics in Information and Technology Professionals
No ratings yet
Ethics in Information and Technology Professionals
36 pages
LM3122ACY-1: LCD Module User Manual
No ratings yet
LM3122ACY-1: LCD Module User Manual
9 pages
Blockchain For 5G Healthcare Applications Security and Privacy Solutions
No ratings yet
Blockchain For 5G Healthcare Applications Security and Privacy Solutions
582 pages
Dissertation Computer Science Example
100% (2)
Dissertation Computer Science Example
4 pages
Choosing A Digital Repository
No ratings yet
Choosing A Digital Repository
30 pages
ReleaseNotes WiFi 23.90
No ratings yet
ReleaseNotes WiFi 23.90
3 pages
Práctica de Laboratorio 21.4.7
No ratings yet
Práctica de Laboratorio 21.4.7
9 pages
Software Reliability
100% (1)
Software Reliability
49 pages
Agilent Technologies 1141A Differential Probe and 1142A Probe Control and Power
No ratings yet
Agilent Technologies 1141A Differential Probe and 1142A Probe Control and Power
90 pages
Learning Guide #38: Information Technology Support Service
No ratings yet
Learning Guide #38: Information Technology Support Service
23 pages
Band-in-a-Box 2023 Manual
No ratings yet
Band-in-a-Box 2023 Manual
434 pages
Powerpoint Homework Tasks
100% (1)
Powerpoint Homework Tasks
6 pages
Microsoft AZ 102
No ratings yet
Microsoft AZ 102
359 pages
(OOP2024) Lecture 1 - Programming Languages History and Paradigms
No ratings yet
(OOP2024) Lecture 1 - Programming Languages History and Paradigms
27 pages
Istar Ultra Hardware Install Guide - Ra0 - LT - en
No ratings yet
Istar Ultra Hardware Install Guide - Ra0 - LT - en
32 pages
Lan, Man, Wan
No ratings yet
Lan, Man, Wan
4 pages
Santosh (3 0)
No ratings yet
Santosh (3 0)
4 pages
Example File - Sample (Dummy) Files
No ratings yet
Example File - Sample (Dummy) Files
7 pages
Universal Tally Note 2074&m18
100% (6)
Universal Tally Note 2074&m18
56 pages
BIA 5000 Introduction To Analytics - Lesson 3
No ratings yet
BIA 5000 Introduction To Analytics - Lesson 3
35 pages
Benefits of Artificial Intelligence and Machine Learning in Marketing
No ratings yet
Benefits of Artificial Intelligence and Machine Learning in Marketing
6 pages
Letter G Card
No ratings yet
Letter G Card
2 pages
The Ultimate Anki Guide - Med Student Edition
No ratings yet
The Ultimate Anki Guide - Med Student Edition
18 pages
Simulation Details - Phish Insight
No ratings yet
Simulation Details - Phish Insight
1 page
PeopleLink Quadro P - Audio Conference Speakerphone
No ratings yet
PeopleLink Quadro P - Audio Conference Speakerphone
4 pages
Aud3 Chap3 Sample Quiz
No ratings yet
Aud3 Chap3 Sample Quiz
25 pages
Frequently Asked Questions
No ratings yet
Frequently Asked Questions
5 pages
Electronic Weighing Indicator: Features
No ratings yet
Electronic Weighing Indicator: Features
2 pages
Prototyping Methods
No ratings yet
Prototyping Methods
9 pages
KEY - TestNav Practice (Algebra I SOL 2016) With Detail - 23-24
No ratings yet
KEY - TestNav Practice (Algebra I SOL 2016) With Detail - 23-24
13 pages