CE802 Pilot

This proposal aims to use machine learning to predict whether a new hotel opened by a large chain would be profitable based on its location and other socioeconomic factors. Classification algorithms like decision trees, support vector machines, and naive bayes will be tested on historical hotel data. Decision trees achieved the highest accuracy of 81.2% compared to 64.4% for SVM and 65.6% for naive bayes. The proposal is to use a decision tree to make predictions on test data and output the results.

Uploaded by

prenithjohnsamuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views2 pages

CE802 Pilot

Uploaded by

prenithjohnsamuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Assignment CE802 Machine Learning:

Design and Application of a Machine Learning System for a

Practical Problem.

Pilot Study proposal:

This study aims to identify and implement a suitable machine learning technique appropriate for a
problem regarding the prediction of a new hotel of a large hotel chain if opened in a given location to be
profitable or not.

Location is one of the most important factors affecting the business of a hotel. The location of the
hotel determines whether the business of the hotel would seek profit of not. The location along with the
geographical data and socio-economic data such as health, crime, population, availability to public
transport etc. would be some of the most important factors required for us to proceed with the machine
learning process. Luckily, we are provided with the historical data of successful and unsuccessful hotels
opened under the chain’s brand on similar locational conditions and also provided with the geographical
and socio-economic data about the locations and neighborhoods.

To proceed with the prediction, we must perform predictive task on the given set of data. There are
various predictive tasks available for designing a machine learning system. Some of them are,
• Classification
• Regression
• Clustering
• Rules mining etc.

We will be proceeding with the Classification type of prediction technique because classification
predicts the belonging to a class. Classification technique is preferred when the results of the model need
to return the belongingness of data points in a dataset to specific classes. In our case, we want to find out
whether the hotel would turn out to be as either class, profit which is true or class loss which is false
based on the given set of data.

After selecting the type of technique, we are going to proceed with the preprocessing process. The
preprocessing process helps to improve the accuracy of the model. There are various preprocessing
methods, and we must select the ones that we need based on our dataset. In, our case we have over five
hundred missing values in the column F21. To fill the missing values, we can find either one of these
mean, median and mode and fill up the spaces of the missing values. We are using mean of the coloumn
to fill up the missing values.

The next step is we have to pick a suitable classifier for our data. There are various types of
classifiers available to perform the classification process. Some of them are,
• Decision Tree classifiers
• K-Nearest neighbor
• K-means
• Random Forest
• Support vector machine
• Naïve Bayes etc.
We will be implementing some of these classifiers to see which one of those classifiers predicts the
most accurate result for our dataset. We’ll be using Decision tree mode, Support vector machine model
and Naïve Bias model and find which one of those classifiers predicts the more accurate data.

First we will use the Decision tree model to predict the outcomes. A Decision tree gets its name as
it uses a tree like model to make decisions and the decision’s possible consequences. First we split the
P2_Data csv into train and test data. We’ll train the model using the train data and use the test data to
predict the output. For the Decision tree algorithm, we get a prediction accuracy score of 81.2%.

Then we will use the Support Vector Machine model to predict the outcomes. It uses
classification algorithms for the process of group classification. In our case, the group to be true or false.
First we split the P2_Data csv into train and test data. We’ll train the model using the train data and use
the test data to predict the output. For the SVM algorithm, we get a prediction accuracy score of 64.4%.

And then we will use the Naive Bayes model to predict the outcomes. It uses Naïve Bayes’s
theorem to get the predictions. First we split the P2_Data csv into train and test data. We’ll train the
model using the train data and use the test data to predict the output. For the Naïve Bias algorithm, we get
a prediction accuracy score of 65.6%.

Out of these algorithms, Decision tree algorithms seem to predict a more accurate result compared
to the other two models. So, we’ll use decision tree algorithm for our dataset. Now, we’ll implement the
decision tree on P2_test_data csv file to predict the class colomn. After the prediction, we’ll print out the
predictions into another excel under the name, P2_test_predictions.csv which would be our desired
output.

CE802 Report
No ratings yet
CE802 Report
7 pages
Credit Card Approval Prediction Report-Final
No ratings yet
Credit Card Approval Prediction Report-Final
27 pages
Churn Prediction with ML Techniques
No ratings yet
Churn Prediction with ML Techniques
77 pages
Data Science Assignment 2
No ratings yet
Data Science Assignment 2
14 pages
Unit-4 Data Mining
No ratings yet
Unit-4 Data Mining
19 pages
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
No ratings yet
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
13 pages
Capstone 2 Corizo
No ratings yet
Capstone 2 Corizo
2 pages
Admission Prediction Model Study
No ratings yet
Admission Prediction Model Study
4 pages
Chapter 2 Machine Learning Draft-85-172
No ratings yet
Chapter 2 Machine Learning Draft-85-172
88 pages
AI and ML Lab Ex3 To 12
No ratings yet
AI and ML Lab Ex3 To 12
27 pages
Loan Prediction for Banks
No ratings yet
Loan Prediction for Banks
3 pages
Minor Project
No ratings yet
Minor Project
9 pages
TB 969425740
No ratings yet
TB 969425740
16 pages
Income Prediction
No ratings yet
Income Prediction
19 pages
Chapter 8
No ratings yet
Chapter 8
15 pages
CHAPTER 4 Diabetes
No ratings yet
CHAPTER 4 Diabetes
6 pages
DM Final
No ratings yet
DM Final
79 pages
DM Assignment 2
No ratings yet
DM Assignment 2
23 pages
Lec 2
No ratings yet
Lec 2
13 pages
Oe Cae 3
No ratings yet
Oe Cae 3
7 pages
Turover Prediction
No ratings yet
Turover Prediction
52 pages
Abstract (1) - 1
No ratings yet
Abstract (1) - 1
3 pages
Data Collection
No ratings yet
Data Collection
8 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
INT354 - Unit 2
No ratings yet
INT354 - Unit 2
26 pages
FULLTEXT02
No ratings yet
FULLTEXT02
72 pages
Mini Project 2024
100% (1)
Mini Project 2024
48 pages
ML Exam Solutions
No ratings yet
ML Exam Solutions
6 pages
Data Minning Unit 2-1
No ratings yet
Data Minning Unit 2-1
10 pages
Supervised Learning Classification Algorithms Comparison
No ratings yet
Supervised Learning Classification Algorithms Comparison
6 pages
DM Unit - 3
No ratings yet
DM Unit - 3
21 pages
AIML-Unit 5 Notes-Assignment 5
No ratings yet
AIML-Unit 5 Notes-Assignment 5
24 pages
Orange3 Data Mining Library Using Python
50% (2)
Orange3 Data Mining Library Using Python
102 pages
Ai Unit 5
No ratings yet
Ai Unit 5
13 pages
Final Research Paper
No ratings yet
Final Research Paper
3 pages
Practical Machine Learning Code Examples
No ratings yet
Practical Machine Learning Code Examples
33 pages
AI Classification Homework Solutions
No ratings yet
AI Classification Homework Solutions
31 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
AIML Short Term Internship Session 10 Summary-1719293295226
No ratings yet
AIML Short Term Internship Session 10 Summary-1719293295226
3 pages
Data Mining Classification Models
No ratings yet
Data Mining Classification Models
5 pages
ML - Collection.2019 04 15
No ratings yet
ML - Collection.2019 04 15
30 pages
Technical Assignment 2
No ratings yet
Technical Assignment 2
3 pages
Placment Predection Using Machine Learning
No ratings yet
Placment Predection Using Machine Learning
9 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Loan Approval Prediction2
No ratings yet
Loan Approval Prediction2
72 pages
Orange 3
100% (1)
Orange 3
46 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages
Comparative Study of Classifiers for ILPD
No ratings yet
Comparative Study of Classifiers for ILPD
30 pages
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
No ratings yet
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
22 pages
Methods and Models
No ratings yet
Methods and Models
12 pages
Unit 5
No ratings yet
Unit 5
11 pages
Flight Price Prediction Report
No ratings yet
Flight Price Prediction Report
18 pages
Salary Prediction-2
No ratings yet
Salary Prediction-2
26 pages
Preface To The Second Edition V 1 1
No ratings yet
Preface To The Second Edition V 1 1
9 pages
Lockheed F-117A Nighthawk Utility Flight Manual - Djvu
No ratings yet
Lockheed F-117A Nighthawk Utility Flight Manual - Djvu
101 pages
Modelling ICT Development in Education: Head of Office, UNESCO-UNEVOC, International Centre, Bonn, Germany
No ratings yet
Modelling ICT Development in Education: Head of Office, UNESCO-UNEVOC, International Centre, Bonn, Germany
10 pages
UML Diagrams: Class, Object, Use Case
No ratings yet
UML Diagrams: Class, Object, Use Case
6 pages
The Thermal Mass Flow Meter For High Accuracy Air, Gas, and Mixture Flow Measurement
No ratings yet
The Thermal Mass Flow Meter For High Accuracy Air, Gas, and Mixture Flow Measurement
20 pages
Wireless Ink Tank Printer DCP-T520W
No ratings yet
Wireless Ink Tank Printer DCP-T520W
2 pages
User's Manual AHC Winch
No ratings yet
User's Manual AHC Winch
366 pages
B.Voc Mechanical Manufacturing With Hero MotoCorp Scheme & Syllabus 2021 24 Onwards PDF
No ratings yet
B.Voc Mechanical Manufacturing With Hero MotoCorp Scheme & Syllabus 2021 24 Onwards PDF
70 pages
Annex-A Nomination Form HFTSK - Final PRISTINE
No ratings yet
Annex-A Nomination Form HFTSK - Final PRISTINE
2 pages
Agile Testing for Developers
No ratings yet
Agile Testing for Developers
119 pages
FAS Finds Interference... Now What
No ratings yet
FAS Finds Interference... Now What
8 pages
Manual Latex
No ratings yet
Manual Latex
37 pages
Design of An Arduino Based Crop Health Alert Robot Using Light and Motion Response
No ratings yet
Design of An Arduino Based Crop Health Alert Robot Using Light and Motion Response
17 pages
UCODE Lecture v2.3
No ratings yet
UCODE Lecture v2.3
45 pages
Dbms-Class-07-Database Administrator
No ratings yet
Dbms-Class-07-Database Administrator
2 pages
Digital Citizenship
No ratings yet
Digital Citizenship
17 pages
Beam Management (5G RAN3.1 - 02)
100% (1)
Beam Management (5G RAN3.1 - 02)
49 pages
Consumer Education Narrative 12.28.22
No ratings yet
Consumer Education Narrative 12.28.22
1 page
Resident Evil 6
No ratings yet
Resident Evil 6
11 pages
Huawei B310s
No ratings yet
Huawei B310s
4 pages
CV59SH-ASM - Placa Main - Service Manual
100% (1)
CV59SH-ASM - Placa Main - Service Manual
55 pages
VZ7000 Converter Instruction Manual: Common For 200V/400V Converters
No ratings yet
VZ7000 Converter Instruction Manual: Common For 200V/400V Converters
44 pages
Firmware Update Guide: PS3/PS4 To Wii U Super Converter
No ratings yet
Firmware Update Guide: PS3/PS4 To Wii U Super Converter
5 pages
How To Fix The Error "Named Pipes Provider, Error 40 - Could Not Open A Connection To SQL Server"
No ratings yet
How To Fix The Error "Named Pipes Provider, Error 40 - Could Not Open A Connection To SQL Server"
12 pages
Adding Products to WooCommerce Store
No ratings yet
Adding Products to WooCommerce Store
14 pages
Networking 101
No ratings yet
Networking 101
3 pages
Long Son Port Invert T Beam Model
No ratings yet
Long Son Port Invert T Beam Model
48 pages
8085 Programs
No ratings yet
8085 Programs
8 pages
Clarion ASP Annotated Examples
No ratings yet
Clarion ASP Annotated Examples
188 pages
Cao Assignment
No ratings yet
Cao Assignment
19 pages
Referece Paper
No ratings yet
Referece Paper
9 pages

CE802 Pilot

Uploaded by

CE802 Pilot

Uploaded by

Assignment CE802 Machine Learning:

Design and Application of a Machine Learning System for a

Pilot Study proposal:

You might also like