100% found this document useful (1 vote)

1K views3 pages

Data Science Hackathon Insights

The participant took part in a hackathon organized by Great Learning to predict passenger satisfaction levels from their Shinkansen bullet train travel experiences in Japan. They were given train and test datasets on travel details and survey responses and used classification models like decision trees, random forests, boosting, bagging, naive Bayes and logistic regression. Adaptive boosting with random forests as the base estimator performed best with 95.39% accuracy, placing the participant highly in the initial leaderboard. It was a valuable learning experience in applying data science techniques to predict customer satisfaction.

Uploaded by

Nishant Sethia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

1K views3 pages

Data Science Hackathon Insights

Uploaded by

Nishant Sethia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Shinkansen Travel Experience -

Hackathon

1 Shinkansen Bullet Train - Japan

I am glad to share that recently I participated in a Hackathon organized by Great

Learning in collaboration with McCombs School of Business and Great Lakes
Institute of Management as a Part of my Course PGP - DSBA.

The goal of the problem is to predict whether a passenger was delighted considering
his/her overall travel experience of traveling on the Shinkansen (Bullet Train).

We are given four different datasets comprising two as in train sets and the other
two as the test sets. Among them, one pair of train and test set is of travel data and
the other pair of train and test sets is of the survey data.

I have performed EDA to understand the data. It’s a binary Classification data of
customer satisfaction of who traveled in the bullet train. The data was collected on
various parameters but the ultimate goal was to predict the overall customer
satisfaction. I used various classification models for prediction such as:

1) A Classification and Regression Tree (CART), is a predictive model, which

explains how an outcome variable's values can be predicted based on other values.
A CART output is a decision tree where each fork is split into a predictor variable
and each end node contains a prediction for the outcome variable.
2) Random Forest Regression is a supervised learning algorithm that uses an
ensemble learning method for regression. The ensemble learning method is a
technique that combines predictions from multiple machine learning algorithms to
make a more accurate prediction than a single model

3) Boosting, in machine learning, boosting is an ensemble meta-algorithm for

reducing bias, variance in supervised learning, and a family of machine learning
algorithms that convert weak learners to strong ones.

4) Bagging, also known as bootstrap aggregation, is the ensemble learning method

that is commonly used to reduce variance within a noisy dataset. In bagging, a
random sample of data in a training set is selected with replacement—meaning that
the individual data points can be chosen more than once.

5) Naïve Bayes Classifier is one of the simple and most effective classification

algorithms that help build fast machine learning models that can make quick
predictions. It is a probabilistic classifier, which means it predicts based on the
probability of an object.

6) Logistic regression is a supervised learning classification algorithm used to

predict the probability of a target variable. The nature of the target or dependent
variable is dichotomous, which means there would be only two possible classes. In
simple words, the dependent variable is binary having data coded as either 1 (stands
for success/yes) or 0 (stands for failure/no). Mathematically, a logistic regression
model predicts P(Y=1) as a function of X.

Adaptive Boosting with base estimator RF worked well for me. I have achieved
95.39% accuracy in my prediction. For a while (14 hours) I was at the top of the
leader board. However, I participated to win and learn as much as possible and I
learned a lot and was able to be in the top 5. Looking forward to more such
participations. It was a wonderful learning experience and would like to use these
useful Data Science techniques at my workplace too.

Thank You #greatlearning for this experience.

#machinelearning #datascience #greatlearning #hackathon #hackofalltrades

Article Link

https://www.linkedin.com/pulse/shinkansen-travel-experience-hackathon-nishant-rai-sethia

Social Media Tourism: Model Analysis
No ratings yet
Social Media Tourism: Model Analysis
39 pages
Machine Learning & Data Mining
No ratings yet
Machine Learning & Data Mining
4 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Ai Project Cycle Short Note
No ratings yet
Ai Project Cycle Short Note
9 pages
A Study of Classification Algorithms Using Rapidminer
No ratings yet
A Study of Classification Algorithms Using Rapidminer
12 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
14 pages
Unit-4 Data Mining
No ratings yet
Unit-4 Data Mining
19 pages
Intro to Machine Learning Concepts
100% (1)
Intro to Machine Learning Concepts
58 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
AI ML K6rn1i 54 Merged
No ratings yet
AI ML K6rn1i 54 Merged
6 pages
Intro ML 1 Day
No ratings yet
Intro ML 1 Day
43 pages
MLSC Final Notes
No ratings yet
MLSC Final Notes
24 pages
SRU ADA Unit-3
No ratings yet
SRU ADA Unit-3
78 pages
Midterm IAIDS Exam at Fasilkom UI-1
No ratings yet
Midterm IAIDS Exam at Fasilkom UI-1
14 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Q3
No ratings yet
Q3
7 pages
Tools of Machine Learning
No ratings yet
Tools of Machine Learning
3 pages
ML-classification Models
No ratings yet
ML-classification Models
27 pages
Aiml University Ans Key
No ratings yet
Aiml University Ans Key
6 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
Classification Notes
No ratings yet
Classification Notes
14 pages
Unit III 1
No ratings yet
Unit III 1
21 pages
Answer 2022-23
No ratings yet
Answer 2022-23
22 pages
AWS Machine Learning Specialty Master Cheat Sheet
No ratings yet
AWS Machine Learning Specialty Master Cheat Sheet
24 pages
86 37 196 Mod 5
No ratings yet
86 37 196 Mod 5
52 pages
Capstone Project
No ratings yet
Capstone Project
6 pages
Customer Credit Rating by Machine Learning
No ratings yet
Customer Credit Rating by Machine Learning
9 pages
Aasignment
No ratings yet
Aasignment
7 pages
ML Unit II Modelling Notes
No ratings yet
ML Unit II Modelling Notes
18 pages
Data Science Assignment 2
No ratings yet
Data Science Assignment 2
14 pages
AI Learning: Methods and Models
No ratings yet
AI Learning: Methods and Models
19 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Unit 1 Pyq
No ratings yet
Unit 1 Pyq
61 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
AI-Powered ETA Prediction
No ratings yet
AI-Powered ETA Prediction
3 pages
Machine Learning & Data Types Guide
No ratings yet
Machine Learning & Data Types Guide
22 pages
Unit 4 - Question Bank and Answers
No ratings yet
Unit 4 - Question Bank and Answers
23 pages
Supervised Learning Notes
No ratings yet
Supervised Learning Notes
7 pages
Data Science Cheatsheet
No ratings yet
Data Science Cheatsheet
4 pages
Turover Prediction
No ratings yet
Turover Prediction
52 pages
Shivaji University, Kolhapur
No ratings yet
Shivaji University, Kolhapur
12 pages
Machine Learning Extended Project - BrahmaChari
No ratings yet
Machine Learning Extended Project - BrahmaChari
29 pages
Election Prediction Model Analysis
100% (2)
Election Prediction Model Analysis
46 pages
Capstone Project
No ratings yet
Capstone Project
6 pages
cz4041 Project Final Report Nyc Taxi Fare Prediction
0% (1)
cz4041 Project Final Report Nyc Taxi Fare Prediction
18 pages
Presentation On Supervised Learning
No ratings yet
Presentation On Supervised Learning
8 pages
Ijcrt 195700
No ratings yet
Ijcrt 195700
7 pages
PPSD 1683560645
No ratings yet
PPSD 1683560645
9 pages
MLT Study
No ratings yet
MLT Study
22 pages
Bilal Ahmed Shaik Data Mining
No ratings yet
Bilal Ahmed Shaik Data Mining
88 pages
Customer Loan Prediction: Term Project Report
100% (1)
Customer Loan Prediction: Term Project Report
11 pages
Project Lit Final1
No ratings yet
Project Lit Final1
15 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
Final ML
No ratings yet
Final ML
2 pages
Advanced Statistics: Business Report Ranvijay Sharma
No ratings yet
Advanced Statistics: Business Report Ranvijay Sharma
16 pages
SMDM Project
No ratings yet
SMDM Project
16 pages
Advanced Statistics Project - Jayant Chandra
No ratings yet
Advanced Statistics Project - Jayant Chandra
20 pages
Topic-Based Classification and Identification of Global Trends For Startup Companies
No ratings yet
Topic-Based Classification and Identification of Global Trends For Startup Companies
31 pages
MLP Backpropagation Analysis
No ratings yet
MLP Backpropagation Analysis
1 page
RandomForest Vs SVM Comparison
No ratings yet
RandomForest Vs SVM Comparison
1 page
Blockchain-Based Secure Healthcare Application For Diabetic-Cardio Disease Prediction in Fog Computing PDF
No ratings yet
Blockchain-Based Secure Healthcare Application For Diabetic-Cardio Disease Prediction in Fog Computing PDF
15 pages
Azure AI Fundamentals Study Guide and Practice Exam For The Microsoft AI-900 Exam (David Voss David Voss) (Z-Library)
No ratings yet
Azure AI Fundamentals Study Guide and Practice Exam For The Microsoft AI-900 Exam (David Voss David Voss) (Z-Library)
77 pages
1281819944artificial Intelligence & Machine Learning
No ratings yet
1281819944artificial Intelligence & Machine Learning
109 pages
AI Deep Learning Cheat Sheets-From BecomingHuman - Ai PDF
100% (3)
AI Deep Learning Cheat Sheets-From BecomingHuman - Ai PDF
25 pages
Ai Tools and Applications-Lab
No ratings yet
Ai Tools and Applications-Lab
33 pages
Weka Tutorial
No ratings yet
Weka Tutorial
2 pages
MScFE 650 MLF - Video - Transcripts - M3
No ratings yet
MScFE 650 MLF - Video - Transcripts - M3
19 pages
Paper-Final For Phase 1
No ratings yet
Paper-Final For Phase 1
16 pages
Acs Molpharmaceut 6b00248
No ratings yet
Acs Molpharmaceut 6b00248
7 pages
Medical Data Analysis Using Machine Learning Techniques: Devansh Bhasin (14Bcb0045)
No ratings yet
Medical Data Analysis Using Machine Learning Techniques: Devansh Bhasin (14Bcb0045)
51 pages
Machine Learning With Quantum Computers 2nd Edition Francesco Petruccione Full
100% (4)
Machine Learning With Quantum Computers 2nd Edition Francesco Petruccione Full
159 pages
Loan Prediction Using Machine Learning
No ratings yet
Loan Prediction Using Machine Learning
89 pages
Internship Report - Spoorthi
No ratings yet
Internship Report - Spoorthi
27 pages
Machine Learning Internship Report
No ratings yet
Machine Learning Internship Report
19 pages
Unstructured Data Classification
100% (2)
Unstructured Data Classification
83 pages
Faculty Publications 2020-21 and 2021 - 22
No ratings yet
Faculty Publications 2020-21 and 2021 - 22
14 pages
Discuss The Relationship Between Engineering and Economics. - 20240619 - 040311 - 0000
No ratings yet
Discuss The Relationship Between Engineering and Economics. - 20240619 - 040311 - 0000
1 page
The Best Time To Learn Machine Learning Is NOW: A Complete Project List
No ratings yet
The Best Time To Learn Machine Learning Is NOW: A Complete Project List
12 pages
Deep Learning Notes
100% (1)
Deep Learning Notes
71 pages
KM MCQs 250Q
No ratings yet
KM MCQs 250Q
48 pages
Tomato Disease Classification 1 3
No ratings yet
Tomato Disease Classification 1 3
3 pages
Machine Learning Exam Questions
No ratings yet
Machine Learning Exam Questions
1 page
Cyber Bullying Detection Using Machine Learning
No ratings yet
Cyber Bullying Detection Using Machine Learning
4 pages
AIML Internship Report
No ratings yet
AIML Internship Report
53 pages
A Very Brief Introduction To Machine Learning With Applications To Communication Systems
No ratings yet
A Very Brief Introduction To Machine Learning With Applications To Communication Systems
20 pages
Pydata 2021 CV Tesco
No ratings yet
Pydata 2021 CV Tesco
28 pages
Lesson 9
No ratings yet
Lesson 9
15 pages

Data Science Hackathon Insights

Uploaded by

Data Science Hackathon Insights

Uploaded by

Shinkansen Travel Experience -

1 Shinkansen Bullet Train - Japan

I am glad to share that recently I participated in a Hackathon organized by Great

1) A Classification and Regression Tree (CART), is a predictive model, which

3) Boosting, in machine learning, boosting is an ensemble meta-algorithm for

4) Bagging, also known as bootstrap aggregation, is the ensemble learning method

5) Naïve Bayes Classifier is one of the simple and most effective classification

6) Logistic regression is a supervised learning classification algorithm used to

Thank You #greatlearning for this experience.

#machinelearning #datascience #greatlearning #hackathon #hackofalltrades

You might also like