Airline Tweets Classification Using Naive Bayes Classifier

This document summarizes a study that used a Naive Bayes classifier to analyze sentiment in airline tweets. The study collected tweets about airline services and classified them as positive, negative, or neutral. It found most tweets were negative (62.6%). The study used the Naive Bayes algorithm to assign probabilities to words and classify tweets based on the probabilities of sentiment categories. It achieved 82% accuracy, faster than a benchmark program using NLTK libraries. Further improving the tokenization and balancing the dataset could increase the accuracy of the Naive Bayes classifier for sentiment analysis of airline tweets.

Uploaded by

Kaninsan Joshua

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views2 pages

Airline Tweets Classification Using Naive Bayes Classifier

Uploaded by

Kaninsan Joshua

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Airline Tweets Classification Using Naive Bayes Classifier

Kaninson Joshua R, Sri Haran K, Sudalai Muthu Selva Kumar S.S, Dr. D. Jemi Florinabel *
Dr.Sivanthi Aditanar College of Engineering,Tiruchendur
I)Abstract: hence it can give results with better accuracy of
Every second, Our modern world classification.
produces terabytes of data. In this project, we
have developed a program to classify the
III) Methodology
reviews about a particular service based on 1.Dataset Collect
their customers' tweets. Our project would help We use the Kaggle dataset for this
them to analyze the overall thoughts about that project. we used the columns text and
particular service. Moreover, they can use this airline_sentiment to classify the tweets
to improve the quality of the services. Percentage of each class of tweets:
Furthermore, extending this project to a wide 1. positive-19.2
range of products and services resulting in a 2. neutral-21.2
better understanding between the service 3. negative-62.6
providers and the customers. On Twitter, the
customer of airline services can tweet their
opinions about their traveled experiences in
flight. So, Twitter contains a massive amount Fig:
of data and information regarding airline 1.0 :
services. These tweets are collected and
explore the sentiments about the airline
services to track customer satisfaction reports.
This project aims to analyze the twitter airline Dataset Analysis
dataset for finding the overall Positive, 2.Bayes Algorithm:
Negative, and Neural tweets using the Naive The below-given formula can explain
Bayes algorithm. Bayes Algorithm
p(A|B) =p(B|A) p(A)/p(B)
II)Introduction 1. p(A), p(B) -are the probabilities of
The simple solution would be to set a observing A and B respectively
number that humans consider positive and 2. (A|B)- the likelihood of event A
negative and let the program count the number occurring given that B is true.
of positive and negative words in each tweet.
The drawback would be that a word's set is 3.Naive Bayes Classifier Algorithm:
Naïve Bayes Classifier is one of the simple
limited and may not work in the particular
and most effective Classification algorithms,
domain, and that may omit some emotionless
which is the fast machine learning model that
keywords. Also, the accuracy of the algorithm
can make quick predictions. Naïve is because it
would be very low. However, according to the
assumes that the occurrence of a particular
domain, training the Naive Bayes is possible;
feature is independent of the occurrence of
hence, it performs better than this algorithm.
other features. Bayes that is because they
Naive Bayes gives weightage for each word;
depend on Bayes Algorithm
4.Working: Fig 1.2 : Result Analysis
Naive Bayes Classifier is for Text 1. The program with library function had an
classification. First, the algorithm Finds the accuracy of 82.0% of accuracy and took 8
total number of positive, negative, and neutral minutes.
2. The proposed program has 2.5% less
tweets. Then it finds the prior probability by
accuracy and is 320X faster than the
dividing the number of each class of tweets by benchmarking program.
total tweets. Then it separates every tweet into
tokens. Then find the number of occurrences of IX)Conclusion:
these tokens in positive, negative, and neutral The program can be implemented with a
tweets. The testing involves the following better tokenizing algorithm to increase its
steps. First, split each word in the tweets into accuracy by a bit. If we train the classifier with
tokens. Then Find the probability of being a balanced data set or use some data balancing
positive for each token and multiply it to prior algorithm to balance the dataset, the accuracy
probability. Then Find the probability of being will increase.
negative for each token and multiply it to prior
X) References:
probability. Then Find the probability of being
1. https://www.hindawi.com/journals/misy/
negative for each token and multiply it to prior 2019/1790429/
probability and compare the probability of each 2. Berrar, D., 2018. Bayes' theorem and naive
class of sentiment and return the sentiment Bayes classifier. Encyclopedia of
Bioinformatics and Computational
with the highest probability. Biology: ABC of Bioinformatics; Elsevier
VIII)Performance Measures: Science Publisher: Amsterdam, The
1. Sample Input/Output: Netherlands, pp.403-412.
3. Yang, Feng-Jen. "An implementation of
naive Bayes classifier." In 2018
International Conference on Computational
Science and Computational Intelligence
(CSCI), pp. 301-306. IEEE, 2018.
4. Keogh, Eamonn. "Naive Bayes classifier."
Accessed: Nov 5 (2006): 2017.

Fig 1.1:Sample Input/Output

Comparing the program with the other program
written using nltk library for benchmarking the
performance. The output of the program with
library function is as follows:

Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
10 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
Naive Bayes Algorithm For Classification Tasks: Sana Badagan 1MS24RAI09
No ratings yet
Naive Bayes Algorithm For Classification Tasks: Sana Badagan 1MS24RAI09
31 pages
Naive Bayes Classifier in Machine Learning Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning Javatpoint
23 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
37 pages
Unit 2 AAM
No ratings yet
Unit 2 AAM
32 pages
Bwu Bta 21 289
No ratings yet
Bwu Bta 21 289
10 pages
Naïve Bayes Classifier Explained
No ratings yet
Naïve Bayes Classifier Explained
33 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
11 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
What Is Naive Bayes Algorithm
No ratings yet
What Is Naive Bayes Algorithm
10 pages
DWM Exp5 C49
No ratings yet
DWM Exp5 C49
12 pages
Naive Bayes for Python Newbies
No ratings yet
Naive Bayes for Python Newbies
3 pages
NOTES
No ratings yet
NOTES
15 pages
Mechine Learning
No ratings yet
Mechine Learning
7 pages
6d7701 - Bayesean Classifer
No ratings yet
6d7701 - Bayesean Classifer
8 pages
Cp4252 Machine Learning Lab Manual
No ratings yet
Cp4252 Machine Learning Lab Manual
40 pages
Naive Bayes Classifier Overview
No ratings yet
Naive Bayes Classifier Overview
10 pages
AIML - Ex.3 Manual
No ratings yet
AIML - Ex.3 Manual
4 pages
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
Top Machine Learning Informations About Different Algorithms
No ratings yet
Top Machine Learning Informations About Different Algorithms
63 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
24 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Lecture 12 Dr. Lamiaa
No ratings yet
Lecture 12 Dr. Lamiaa
21 pages
Unit 2.2
No ratings yet
Unit 2.2
9 pages
NaiveBayes N Text Analytics
No ratings yet
NaiveBayes N Text Analytics
20 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Lab Report - CSE 816
No ratings yet
Lab Report - CSE 816
17 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
18 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
14 pages
Naive Bayes
No ratings yet
Naive Bayes
4 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
10 pages
Naive Bayes & SVM Overview
No ratings yet
Naive Bayes & SVM Overview
79 pages
Naïve Bayes Classifier Implementation
No ratings yet
Naïve Bayes Classifier Implementation
5 pages
Naïve Bayes Classifier in Pattern Recognition
No ratings yet
Naïve Bayes Classifier in Pattern Recognition
17 pages
DSBDAL - Assignment No 6
No ratings yet
DSBDAL - Assignment No 6
4 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
16 pages
ML CLassification Naive Bayes
No ratings yet
ML CLassification Naive Bayes
6 pages
NLP Labsheet-2 Sentiment Analysis Using Naive Bayes Classifier
No ratings yet
NLP Labsheet-2 Sentiment Analysis Using Naive Bayes Classifier
15 pages
Lab7&8 NaiveBayes
No ratings yet
Lab7&8 NaiveBayes
5 pages
Lab5 NaiveBayes Full
No ratings yet
Lab5 NaiveBayes Full
5 pages
Naive Bayes in scikit-learn Guide
No ratings yet
Naive Bayes in scikit-learn Guide
4 pages
Naive Bayes Model
No ratings yet
Naive Bayes Model
10 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
3 pages
Dev ML Ex5
No ratings yet
Dev ML Ex5
6 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Twitter Sentiment Analysis Study
No ratings yet
Twitter Sentiment Analysis Study
7 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
L25 - Naïve Bayes
No ratings yet
L25 - Naïve Bayes
18 pages
Supervised Machine Learning Unit 3
No ratings yet
Supervised Machine Learning Unit 3
8 pages
Naive Bayes Etc.
No ratings yet
Naive Bayes Etc.
1 page
ML Notes (III BCA)
No ratings yet
ML Notes (III BCA)
64 pages
Bayes Classifier
No ratings yet
Bayes Classifier
35 pages
Naive Bayes Classifier Explained
No ratings yet
Naive Bayes Classifier Explained
3 pages
SQL Quick Study Guide
No ratings yet
SQL Quick Study Guide
2 pages
Computers - Part 2 PDF
No ratings yet
Computers - Part 2 PDF
7 pages
Installation Manual: MP1800X Series Router
No ratings yet
Installation Manual: MP1800X Series Router
65 pages
Misra Rules
No ratings yet
Misra Rules
5 pages
Spider-User Manual Ver 1.3
No ratings yet
Spider-User Manual Ver 1.3
11 pages
Euro Symbol Display Issues in PDF
No ratings yet
Euro Symbol Display Issues in PDF
2 pages
SSCD PDF
No ratings yet
SSCD PDF
2 pages
Understanding Android Resource Directory
No ratings yet
Understanding Android Resource Directory
3 pages
Oscam Sky
No ratings yet
Oscam Sky
2 pages
Practical 1 - Set Up Your IBM Cloud Account - IBM Quantum Documentation
No ratings yet
Practical 1 - Set Up Your IBM Cloud Account - IBM Quantum Documentation
6 pages
A Division of Mastermind Tutorials Pvt. LTD.: ISO 9001 2008 Certified Company
No ratings yet
A Division of Mastermind Tutorials Pvt. LTD.: ISO 9001 2008 Certified Company
2 pages
Pdms Specon: Reference Manual
No ratings yet
Pdms Specon: Reference Manual
65 pages
Search: Exclusive PAN Centres
No ratings yet
Search: Exclusive PAN Centres
6 pages
Ntroduction: Computer Memory Based On The Protein Bacterio-Rhodopsin
No ratings yet
Ntroduction: Computer Memory Based On The Protein Bacterio-Rhodopsin
19 pages
Modern Infographic PowerPoint Template
No ratings yet
Modern Infographic PowerPoint Template
50 pages
Oracle Applications R12 Order Management and Pricing - Toc
100% (1)
Oracle Applications R12 Order Management and Pricing - Toc
4 pages
HHH
No ratings yet
HHH
38 pages
TMF671 Promotion Management API v4.1.0 Specification
No ratings yet
TMF671 Promotion Management API v4.1.0 Specification
34 pages
Rzal 6 Ospfpdf
No ratings yet
Rzal 6 Ospfpdf
36 pages
Copy Shop: Caldera Software
No ratings yet
Copy Shop: Caldera Software
24 pages
Understanding ASP.NET Life Cycle and Directives
No ratings yet
Understanding ASP.NET Life Cycle and Directives
40 pages
Group 15 - Auditing Robotic Process Automation System
No ratings yet
Group 15 - Auditing Robotic Process Automation System
38 pages
Understanding Firebase Database Features
No ratings yet
Understanding Firebase Database Features
25 pages
Class 8 Comp For Deepanshu
No ratings yet
Class 8 Comp For Deepanshu
1 page
Worlde PANDAMINI II MIDI Controller User's Manual
No ratings yet
Worlde PANDAMINI II MIDI Controller User's Manual
28 pages
How To Model Viral Growth
100% (2)
How To Model Viral Growth
24 pages
Means For Model and Experiment Description
100% (1)
Means For Model and Experiment Description
18 pages
CSC662 - Computer Security Short Note
No ratings yet
CSC662 - Computer Security Short Note
10 pages
Quick Changeover for Manufacturing
100% (1)
Quick Changeover for Manufacturing
28 pages
ICTNWK540 AT1 Design, Build and Test Network Servers
50% (2)
ICTNWK540 AT1 Design, Build and Test Network Servers
16 pages

Airline Tweets Classification Using Naive Bayes Classifier

Uploaded by

Airline Tweets Classification Using Naive Bayes Classifier

Uploaded by

Airline Tweets Classification Using Naive Bayes Classifier

Fig 1.1:Sample Input/Output

You might also like