0% found this document useful (0 votes)

86 views26 pages

Twitter Sentiment Analysis - Final - Report Copy Sahil

This document provides an overview of a project on Twitter sentiment analysis. The purpose is to classify tweets as expressing positive, negative, or neutral sentiment. The objectives are to develop a tool to automatically and accurately classify tweets by sentiment. Motivation for the project includes using Twitter as a better representation of public sentiment compared to other sources, and applying sentiment analysis to domains such as business and politics. The project will develop a functional classifier and evaluate whether additional Twitter user data improves classification accuracy.

Uploaded by

Daksh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views26 pages

Twitter Sentiment Analysis - Final - Report Copy Sahil

Uploaded by

Daksh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Twitter Sentiment Analysis

INDUSTRIAL TRAINING PROJECT REPORT

Submitted in partial fulfilment of the requirements for the Degree

of
BACHELOR OF TECHNOLOGY

In
INFORMATION TECHNOLOGY
By
Sahil Khunger ( 43715603117)

DEPARTMENT OF INFORMATION TECHNOLGY

DR. AKHILESH DASS GUPTA INSTITUTE OF TECH & MGMT
(AFFILIATED TO GURU GOBIND SINGH INDRAPRASTHA UNIVERSITY, DELHI)
NEW DELHI – 110053

i
CERTIFICATE
CANDIDATES DECLARATION

I hereby certify that the work, which is being presented in the project synopsis, entitled
Twitter Sentiment Analysis, in partial fulfilment of the requirement for the award of the
Degree of Bachelor of Technology and submitted to the institution is an authentic record
of my own work carried out during the period 2019 – Present.

Date: Signature of the Candidate

iii
TABLE OF CONTENTS

Title Page No.

ABSTRACT...................................................................................................vi
ACKNOWLEDGEMENT….........................................................................vii
LIST OF FIGURES......................................................................................viii

CHAPTER 1 INTRODUCTION

1.1 Purpose................................................................................................9
1.2 Objective............................................................................................10
1.3 Motivation….....................................................................................11
1.4 Definition Overview…......................................................................12

CHAPTER 2 OVERALL DESCRIPTION

2.1 Project Perspective...........................................................................13

2.2 Project Functions..............................................................................13
2.3 Flow Chart Diagram….....................................................................15
2.4 Constraints and Assumptions….......................................................17

CHAPTER 3 SYSTEM REQUIREMENTS

3.1 External Interface Requirement.......................................................18
3.1.1 Hardware interface..............................................................18
3.1.2 Software Interface...............................................................18
3.2 Functional Requirement...................................................................18
3.3 Non-functional Requirement...........................................................19
3.4 Hardware and Performance Requirement…....................................19

CHAPTER 4 CONCLUSION AND FUTURE WORK

4.1 Results.............................................................................................................20
4.2 Future Work....................................................................................................21
REFERENCES...........................................................................................................22

APPENDIX................................................................................................................23
ABSTRACT

This project addresses the problem of sentiment analysis in twitter; that is classifying
tweets according to the sentiment expressed in them: positive, negative or neutral.
Twitter is an online micro-blogging and social-networking platform which allows users
to write short status updates of maximum length 140 characters. It is a rapidly expanding
service with over 200 million registered users - out of which 100 million are active users
and half of them log on twitter on a daily basis - generating nearly 250 million tweets per
day. Due to this large amount of usage we hope to achieve a reflection of public
sentiment by analyzing the sentiments expressed in the tweets. Sentiment analysis has
many applications in different domains including, but not limited to, business
intelligence, politics, sociology, etc. Recent years, on the other hand, have witnessed the
advent of social networking websites, microblogs, wikis and Web applications and
consequently, an unprecedented growth in user-generated data is poised for sentiment
mining. Data such as web-postings, Tweets, videos, etc., all express opinions on various
topics and events, offer immense opportunities to study and analyze human opinions and
sentiment Analyzing the public sentiment is important for many applications such as
firms trying to find out the response of their products in the market, predicting political
elections and predicting socioeconomic phenomena like stock exchange. The aim of this
project is to develop a functional classifier for accurate and automatic sentiment
classification.

6
ACKNOWLEDGEMENTS

I am deeply thankful to my advisor. Mr. Shailendra Singh for helping me throughout the
course in accomplishing my final project. His guidance, support and motivation enabled
me in achieving the objectives of the project.
LIST OF FIGURES

Figure 1: Proposed System of Sentiment Analysis…........................................................9

Figure 2: Keys and Tokens from Twitter Dev Console...................................................14

Figure 3: Procedural flow chart of Sentiment Analysis...................................................15

Figure 4: Architecture Diagram of Sentiment Analysis...................................................16

CHAPTER I: INTRODUCTION

1.1 Purpose
This project addresses the problem of sentiment analysis in twitter; that is classifying
tweets according to the sentiment expressed in them: positive, negative or neutral.
Twitter is an online micro-blogging and social-networking platform which allows users
to write short status updates of maximum length 140 characters. It is a rapidly expanding
service with over 200 million registered users - out of which 100 million are active users
and half of them log on twitter on a daily basis - generating nearly 250 million tweets per
day. Due to this large amount of usage we hope to achieve a reflection of public
sentiment by analyzing the sentiments expressed in the tweets. Sentiment analysis has
many applications in different domains including, but not limited to, business
intelligence, politics, sociology, etc. Recent years, on the other hand, have witnessed the
advent of social networking websites, microblogs, wikis and Web applications and
consequently, an unprecedented growth in user-generated data is poised for sentiment
mining. Data such as web-postings, Tweets, videos, etc., all express opinions on various
topics and events, offer immense opportunities to study and analyze human opinions and
sentiment Analyzing the public sentiment is important for many applications such as
firms trying to find out the response of their products in the market, predicting political
elections and predicting socioeconomic phenomena like stock exchange. The aim of this
project is to develop a functional classifier for accurate and automatic sentiment
classification.

Figure 1: Proposed System

1.2 Objective

Given a message, classify whether the message is of positive, negative, or neutral

sentiment. For messages conveying both a positive and negative sentiment, whichever is
the stronger sentiment should be chosen.

Questions

Main question: How can Twitter tweets be automatically and accurately classiﬁed with
respect to their sentiment?

In this project the main goal is to accurately classify tweets with respect to their sentiment.
This is realized by developing a tool which can classify the tweets.

Does Twitter’s additional user information improve the classiﬁcation accuracy?

In the data analysis, the performance of a classiﬁcation algorithm is often domain

specific, and therefore depends on the domain where it is applied to. In general, different
classification problems can have different superior algorithms. Also in case of one
domain, a different subset can have a different superior algorithm. Therefore, an
algorithm which is the overall best, does not exist. Restricting the scope possibly leads to
a better accuracy of an algorithm. This could be realized by narrowing the scope down to
a smaller domain, by restricting the content to some topic. In addition, it is interesting to
have more information about the users to possibly segment the users into user groups.
1.3 Motivation

This project has been chosen to work with twitter since it is a better approximation of
public sentiment as opposed to conventional internet articles and web blogs. The reason
is that the amount of relevant data is much larger for twitter, as compared to traditional
blogging sites. Moreover, the response on twitter is more prompt and also more general
(since the number of users who tweet is substantially more than those who write web
blogs on a daily basis).

Sentiment analysis of public is highly critical in macro-scale socioeconomic phenomena

like predicting the stock market rate of a particular firm. This could be done by analyzing
overall public sentiment towards that firm with respect to time and using economics tools
for finding the correlation between public sentiment and the firm’s stock market value.
Firms can also estimate how well their product is responding in the market, which areas
of the market is it having a favorable response and in which a negative response (since
twitter allows us to download stream of geo-tagged tweets for particular locations. If
firms can get this information they can analyze the reasons behind geographically
differentiated response, and so they can market their product in a more optimized manner
by looking for appropriate solutions like creating suitable market segments. Predicting
the results of popular political elections and polls is also an emerging application to
sentiment analysis. Twitter is a popular micro blogging service where users create status
messages (called “tweets”). These tweets sometimes express opinions about different
topics. I propose to build an automatic sentiment (positive or neutral or negative)
extractor from a tweet. This is very useful because it allows feedback to be aggregated
without manual intervention. Using this analyzer,

 Consumers can use sentiment analysis to research products or services before

making a purchase. E.g. Kindle
 Marketers can use this to research public opinion of their company and products,
or to analyze customer satisfaction. E.g. Election Polls
 Organizations can also use this to gather critical feedback about problems in
newly released products. E.g. Brand Management (Nike, Adidas)
1.4 Definition and Overview

Sentiment analysis (also known as opinion mining) refers to the use of text analysis and
to identify and extract subjective information in source materials. Sentiment analysis is
widely applied to reviews and social media for a variety of applications, ranging from
marketing to customer service.
Sentiment analysis is the multidisciplinary field of study that deals with analyzing
people’s sentiments, attitudes, emotions and opinions about different entities such as
products, services, individuals, companies, organizations, events and topics and includes
multiple fields such as information retrieval, machine learning and artificial intelligence.
It is set of computational and NLP based techniques which could be leveraged in order to
extract subjective information in a given text unlike factual information, opinions and
sentiments are subjective.
Generally speaking, sentiment analysis aims to determine the attitude of a speaker or a
writer with respect to some topic or the overall contextual polarity of a document. The
attitude may be his or her judgment or evaluation affective state (that is to say, the
emotional state of the author when writing), or the intended emotional communication
(that is to say, the emotional effect the author wishes to have on the reader.

What is sentiment analysis?

Sentiment Analysis is the process of ‘computationally’ determining whether a piece of

writing is positive, negative or neutral. It’s also known as opinion mining, deriving the
opinion or attitude of a speaker.

Why sentiment analysis?

▪ Business: In marketing field companies use it to develop their strategies, to understand

customers’ feelings towards products or brand, how people respond to their
campaigns or product launches and why consumers don’t buy some products.

▪ Politics: In political field, it is used to keep track of political view, to detect

consistency and inconsistency between statements and actions at the government
level. It can be used to predict election results as well!

▪ Public Actions: Sentiment analysis also is used to monitor and analyze social
phenomena, for the spotting of potentially dangerous situations and determining
the general mood of the public.
CHAPTER 2: OVERALL DESCRIPTION

2.1 Project Perspective

The main perspective of Sentiment Analysis refers to the use of text analysis to identify
and extract subjective information in textual contents. There are two type of user-
generated content available on the web – facts and opinions. Facts are statements about
topics and in the current scenario, easily collectible from the Internet using search
engines that index documents based on topic keywords. Opinions are user specific
statement exhibiting positive or negative sentiments about a certain topic. Generally,
opinions are hard to categorize using keywords. Various text analysis and machine
learning techniques are used to mine opinions from a document. Sentiment Analysis
finds its application in a variety of domains.

2.2 Project Functions

The main function of this project is to collect tweets or feeds from twitter as data in order
to determine Positive and Negative percentage and viewed a visualization based on
twitter data.

Installation Process:

 Tweepy: tweepy is the python client for the official Twitter API.

Install it using following pip command:

pip install tweepy

 TextBlob: textblob is the python library for processing textual data.

Install it using following pip command:

pip install textblob

Authentication:
In order to fetch tweets through Twitter API, one needs to register an App through their
twitter account. Follow these steps for the same:
 Open this link and click the button: ‘Create New App’
 Fill the application details. You can leave the callback URL field empty.
 Once the app is created, you will be redirected to the app page.
 Open the ‘Keys and Access Tokens’ tab.
 Copy ‘Consumer Key’, ‘Consumer Secret’, ‘Access token’ and ‘Access
Token Secret’.

Keys and tokens

Consumer API keys: -

9ODKKNtTFpvf3Sdf7JxuWMxiJ (API key)

nhIjH1RpNRRnIvLrp5BPmt9Sw26DbeYcg0h6vRnfxkLTsv571V (API secret key)

Access token & access token secret: -

794095176427913216-mMiuQhWxaMVysL79czYW0tchubNCeYp (Access token)

wzt6479cP7gSoGOCjpLriDZyv6k57u91S46VFLfFi0LAL (Access token secret)

2.3 : Procedural Flow Chart

Figure 2: Procedural flow chart of Sentiment Analysis

Figure 3: Architecture of Sentiment Analysis
2.4 Constraints and Assumptions

The key challenges for sentiment analysis are: -

 Named Entity Recognition - What is the person actually talking about, e.g. is
300 Spartans a group of Greeks or a movie?

 Parsing - What is the subject and object of the sentence, which one does the verb
and/or adjective actually refer to?

 Sarcasm - If you don't know the author you have no idea whether 'bad' means
bad or good.

 Twitter - abbreviations, lack of capitals, poor spelling, poor punctuation, poor

grammar.
 Argumentation - one of the most growing and challenging directions of future
sentiment analysis techniques on social media is argumentation. While sentiment
analysis is about understanding users' opinions on some aspects, argumentation
aims at identifying the reasons of such opinions and the overall reasoning path in
general.

The following Assumptions are:

 It assumes that data collected are from real account that is all accounts
are real on data and no account is parody.

 It assumes that the collected data is overall overview of people on Twitter.

 It considers all emoticons as special characters having no sentiment.

 It does not consider grammar or structure of tweet as it tokenizes

every word.
CHAPTER 3: SYSTEM REQUIRMENTS

3.1 External Interface Requirement

3.1.1 Hardware Interface
The application is intended to be a stand-alone, single-user system. The
application is running on Windows and MAC. No further hardware devices or interfaces
will be required.

3.1.2 Software Interface

 Inputs: The software will receive input from two sources. First, the user
interface and second, the Twitter API. The user interface will supply the
keywords and the analysis session duration, while the Twitter API will
supply the Tweet text.
 Outputs: The output is showing the current mood of the Twitter
community on a given topic in the form of a simple gauge.
 Operating System: The software will run on the Microsoft Windows 8.1
and Mac OS 10 operating system.

3.2 Functional Requirement

 Retrieving Input:
The software will receive three inputs: keywords and Tweets.

o Keywords will be entered by the user for each topic.

o Tweets will be retrieved with the Twitter Streaming API.

 Real-Time Processing:
The software will take input, process data, and display output in real-time.
This will enforce that the snapshot provided by the simple gauge is a
current view of the Twitter community’s mood on the chosen topic.

 Sentiment Analysis:
Sentiment analysis will be performed on the user-specified keywords
within the Tweet to determine the overall mood of the Tweet relative to
the topic. The sentiment analysis will provide a negative, neutral, or
positive numeric sentiment value.
 Output:
The software must output real time data in the form of a simple gauge. In
addition, the software may output a graph of mood trends over time, as
well as additional statistics pertaining to a topic (average sentiment over
all analysis sessions and total number of tweets processed). This output
should be clear and easy to understand.

3.3 Non Functional Requirement

 Availability:
The software will be available at all times on the user’s device, as long as
the device is in proper working order. The functionality of the software
will depend on any external services such as internet access that are
required. If those services are unavailable, the user should be alerted.
 Security:
The software should never disclose any personal information of Twitter
users, and should collect no personal information from its own users.
 Maintainability:
The software should be written clearly and concisely. The code will be
well documented. Particular care will be taken to design the software
modularly to ensure that maintenance is easy.
 Portability: This software will be designed to run on any Python version
2.7 or higher. The software will be forward compatible for all currently
released Python versions.

3.4 Hardware and Performance Requirement

Real-Time

The software will provide up-to-date information, limited only by the rate of Twitter
input. The gauge output should display the latest results at all times, and if it lags
behind, the user should be notified.
CHAPTER 4: RESULTS AND FUTURE WORK

4.1 Results
Positive tweets percentage: 31.746031746031747 %
Negative tweets percentage: 28.571428571428573 %
Neutral tweets percentage: 39.682539682539684 %

Positive tweets:
RT @Wyn1745: Trump whistleblower reportedly had ‘professional relationship’ with 2020 Democrat -
Whistleblower should be arrested for trea…
RT @brianstelter: The first excerpt from "All the President's Women: Donald Trump and the Making of a Predator," by Barry Levine and
Moniqu…
@Brooke_Kelly87 @realDonaldTrump Keep America Great. Re-elect President Donald J. Trump.
RT @axios: Lindsey Graham tells @jonathanvswan: "If I hear the president say one more time, 'I made a campaign promise to get out of Syria,
…
RT @maddenifico: "He grabbed me there in the front":

Sexual predator Trump allegedly hid behind a tapestry to sexually assault a woman at…
RT @JoeNBC: Ominous for White House: For the first time, 50% of voters support impeaching and removing Donald Trump from office.
https://t…
RT @joncoopertweets: Donald Trump Promised to Eliminate the Deficit in 8 Years. So Far, He Has Increased it by 68%.
https://t.co/UsINtJh5CP
RT @stuartpstevens: A lot of Christian Kurds are facing death because Donald Trump abandoned them. Will the Trump supporting
Evangelicals s…
RT @Will_Bunch: Good morning. There are now 43 NEW allegations of sexual misconduct against Donald Trump. It’s barely 8 a.m.
https://t.co/B…
RT @DearAuntCrabby: Donald Trump's Minneapolis rally to be met with mass protests: "'America First' is a racist lie"

Trump Logic:

"Let's…

Negative tweets:
RT @SexCounseling: @realDonaldTrump They will play their dirty tricks up until election day. That will not stop the red wave and the re el…
RT @jsolomonReports: Ukraine opened new investigation into Hunter Biden linked firm months before Donald Trump’s call with Ukraine’s
presid…
@realDonaldTrump Turkey said its military will cross the border into northern Syria "shortly." DONALD J TRUMP IS A…
https://t.co/96IwQRDtNe
RT @thebradfordfile: With Nancy Pelosi busy raising cash for Trump’s re-election with her fake impeachment, the president will do his part…
RT @RWPUSA: Harvard Psychologist Says Donald Trump's Claims About Destroying Turkey's Economy Would 'Normally Trigger a Mental
Health Hold'…
RT @nitrogenic: Donald Trump's claim that the military ran out of ammunition is "not true," House Armed Services Committee member says
http…
RT @RWPUSA: Between Russia, Turkey and the United States the situation in Syria is an emolumental mess.

Donald Trump's longtime business c…

Donald Trump is a vile, evil tiny man! https://t.co/y819uDKYcm
RT @charliekirk11: What is liberal privilege?

It's Barack Obama holding a rally in Minneapolis' Target Center and only being charged $20,0…
RT @Strandjunker: @realDonaldTrump Why #ImpeachmentTaskForce?!

Above is a result of sentiment analysis is done on “Donald Trump”. Above figure is showing the percentage of positive,
negative and neutral tweets on “Donald Trump” are 31.74%, 28.57% and 39.68% respectively.
4.2 Future work

Future work for project:

 Using different other models and algorithms.

By using different models and algorithms the efficiency of project can be
improved. Also the performance of project will get better in different
situations.

 Graphical Visualization can be added as a future work in

the project.
By using matplotlib library in python we can show different graphical
visualization like Pie Charts, Bar Graphs, Word Net etc.

 Consideration of Retweets as a factor.

Number of Retweets on a post can be considered while assigning a
polarity to the given post.
REFERENCES

[1] M. Bautin, L. Vijayrenu L, and Skenia., “International sentiment analysis for news
and blogs”, In Second International Conference on Weblogs and Social Media
(ICWSM), 2008.

[2] P. Turney., “Thumbs up or thumbs down?” Semantic orientation applied to

unsupervised classiﬁcation of reviews. In Proceedings of the 40th annual meeting of the
Association for Computational Linguistics,pp. 417424, 2002.

[3] B. Pang and L. Lee., “Opinion mining and sentiment analysis”, Foundations and Trends
in Information Retrieval, vol. 2, no. 1-2, pp. 1-135, 2008.

[4] B. Pang and L. Lee., “Using very simple statistics for review search: An exploration”,
In Proceedings of the International C3onference on Computational Linguistics
(COLING), 2008.

[5] K. Dave, S. Lawrence, and D. Pennock., “Mining the peanut gallery: Opinion
extraction and semantic classiﬁcation of product reviews”, pp. 519-528, 2003.

[6] N. Godbole, M. Srinivasaiah, and S. Skiena., “Large-scale sentiment anal-ysis for news
and blogs,” 2007.

[7] A. Kennedy, D. Inkpen,. “Sentiment Classiﬁcation of Movie and Product Reviews

Using Contextual Valence Shifters”, Computational Intelligence, pp.110125, 2006.

[8] J. Kamps, M. Marx, R. Mokken., ”Using WordNet to Measure Semantic Orientation

of Adjectives”, LREC 2004, vol. IV, pp. 11151118, 2004.

[9] V. Hatzivassiloglou, and J. Wiebe., “Eﬀects of Adjective Orientation and Gradability

on Sentence Subjectivity”, Proceedings of the 18th International Conference on
Computational Linguistics, New Brunswick, NJ, 2000.

[10] A. Andreevskaia, S. Bergler, and M. Urseanu, “All Blogs Are Not Made Equal:
Exploring Genre Diﬀerences in Sentiment Tagging of Blogs”, In International
Conference on Weblogs and Social Media (ICWSM-2007), Boulder, CO, 2007.

[11] P. Turney, and M. Littman., “Measuring Praise and Criticism: Inference of Semantic
Orientation from Association”, ACM Transactions on Information Systems.
APPENDIX

Following is the code used while doing Sentiment Analysis:-

import re
import tweepy
from tweepy import OAuthHandler
from textblob import TextBlob

class TwitterClient(object):
'''
Generic Twitter Class for sentiment analysis.
'''
def __init__(self):
'''
Class constructor or initialization method.
'''
# keys and tokens from the Twitter Dev Console
consumer_key = '9ODKKNtTFpvf3Sdf7JxuWMxiJ'
consumer_secret = 'nhIjH1RpNRRnIvLrp5BPmt9Sw26DbeYcg0h6vRnfxkLTsv571V'
access_token = '794095176427913216-mMiuQhWxaMVysL79czYW0tchubNCeYp'
access_token_secret = 'wzt6479cP7gSoGOCjpLriDZyv6k57u91S46VFLfFi0LAL'

# attempt authentication
try:
# create OAuthHandler object
self.auth = OAuthHandler(consumer_key, consumer_secret)
# set access token and secret
self.auth.set_access_token(access_token, access_token_secret)
# create tweepy API object to fetch tweets
self.api = tweepy.API(self.auth)
except:
print("Error: Authentication Failed")

def clean_tweet(self, tweet):

'''
Utility function to clean tweet text by removing links, special characters
using simple regex statements.
'''
return ' '.join(re.sub("(@[A-Za-z0-9]+)|([^0-9A-Za-z \t])|(\w+:\/\/\S+)", " ", tweet).split())

def get_tweet_sentiment(self, tweet):

'''
Utility function to classify sentiment of passed tweet
using textblob's sentiment method
'''
# create TextBlob object of passed tweet text
analysis = TextBlob(self.clean_tweet(tweet))
# set sentiment
if analysis.sentiment.polarity > 0:
return 'positive'
elif analysis.sentiment.polarity == 0:
return 'neutral'
else:
return 'negative'

def get_tweets(self, query, count = 10):

'''
Main function to fetch tweets and parse them.
'''
# empty list to store parsed tweets
tweets = []

try:
# call twitter api to fetch tweets
fetched_tweets = self.api.search(q = query, count = count)

# parsing tweets one by one

for tweet in fetched_tweets:
# empty dictionary to store required params of a tweet
parsed_tweet = {}

# saving text of tweet

parsed_tweet['text'] = tweet.text
# saving sentiment of tweet
parsed_tweet['sentiment'] = self.get_tweet_sentiment(tweet.text)
# appending parsed tweet to tweets list
if tweet.retweet_count > 0:
# if tweet has retweets, ensure that it is appended only once
if parsed_tweet not in tweets:
tweets.append(parsed_tweet)
else:
tweets.append(parsed_tweet)

# return parsed tweets

return tweets

except tweepy.TweepError as e:
# print error (if any)
print("Error : " + str(e))

def main():
# creating object of TwitterClient Class
api = TwitterClient()
# calling function to get tweets
tweets = api.get_tweets(query = 'Donald Trump', count = 200)

# picking positive tweets from tweets

ptweets = [tweet for tweet in tweets if tweet['sentiment'] == 'positive']
# percentage of positive tweets
print("Positive tweets percentage: {} %".format(100*len(ptweets)/len(tweets)))
# picking negative tweets from tweets
ntweets = [tweet for tweet in tweets if tweet['sentiment'] == 'negative']
# percentage of negative tweets
print("Negative tweets percentage: {} %".format(100*len(ntweets)/len(tweets)))
# percentage of neutral tweets
utweet = (100*(len(tweets)-len(ntweets)-len(ptweets))/len(tweets))
print("Neutral tweets percentage: ",utweet,"%")
# printing first 5 positive tweets
print("\n\nPositive tweets:")
for tweet in ptweets[:10]:
print(tweet['text'])

# printing first 5 negative tweets

print("\n\nNegative tweets:")
for tweet in ntweets[:10]:
print(tweet['text'])

if __name__ == "__main__":
# calling main function
main()

Senographe Essential: Operator Manual
No ratings yet
Senographe Essential: Operator Manual
262 pages
Sentiment Analysis of Twitter Data My
75% (4)
Sentiment Analysis of Twitter Data My
14 pages
Sentiment Analysis On Twitter
100% (2)
Sentiment Analysis On Twitter
8 pages
Sentiment Analysis Final Documentation Report
50% (2)
Sentiment Analysis Final Documentation Report
21 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
7 pages
Social Media Sentiment
No ratings yet
Social Media Sentiment
8 pages
Machine Learning For Sentiment Analysis of Twitter Data
No ratings yet
Machine Learning For Sentiment Analysis of Twitter Data
9 pages
Abstract
No ratings yet
Abstract
2 pages
Twitter Sentiment Analysis Using Machine Learning Algorithms IJERTV12IS070128
No ratings yet
Twitter Sentiment Analysis Using Machine Learning Algorithms IJERTV12IS070128
3 pages
Abstract Review PPT Tem - 03
No ratings yet
Abstract Review PPT Tem - 03
7 pages
ProjectFinalReport 2copies
No ratings yet
ProjectFinalReport 2copies
26 pages
FML Project Report
No ratings yet
FML Project Report
18 pages
Batch-6c Minipro Doc Rev-2
No ratings yet
Batch-6c Minipro Doc Rev-2
33 pages
6 Project Report Sem6
No ratings yet
6 Project Report Sem6
13 pages
TSA Synopsis
No ratings yet
TSA Synopsis
18 pages
Sentiment Analysis of Tweets Using Machine Learning
No ratings yet
Sentiment Analysis of Tweets Using Machine Learning
22 pages
finalreview1
No ratings yet
finalreview1
4 pages
Digital Assignment-1 Literature Review On Twitter Sentiment Analysis Name: G.Tirumala Reg No: 16BCE0202 1)
No ratings yet
Digital Assignment-1 Literature Review On Twitter Sentiment Analysis Name: G.Tirumala Reg No: 16BCE0202 1)
9 pages
Depicting The Public Sentiment Variations On Twitter
No ratings yet
Depicting The Public Sentiment Variations On Twitter
3 pages
Introduction
No ratings yet
Introduction
27 pages
Cmu CS QTR 127
No ratings yet
Cmu CS QTR 127
38 pages
Seminar Report 52110
No ratings yet
Seminar Report 52110
26 pages
minor_project_report
No ratings yet
minor_project_report
29 pages
Sentiment Analysis On Twitter Using Streaming Api: Abstract
No ratings yet
Sentiment Analysis On Twitter Using Streaming Api: Abstract
5 pages
Final Twitter - Sentiment - Analysis - Report
100% (1)
Final Twitter - Sentiment - Analysis - Report
14 pages
Twitte Analysis
No ratings yet
Twitte Analysis
53 pages
A Review On Twitter Sentiment Analysis Approaches
No ratings yet
A Review On Twitter Sentiment Analysis Approaches
5 pages
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
No ratings yet
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
51 pages
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
No ratings yet
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
51 pages
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
No ratings yet
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
51 pages
fin_ijprems1714118825
No ratings yet
fin_ijprems1714118825
6 pages
twitter sentiment analysis ppt
100% (2)
twitter sentiment analysis ppt
10 pages
IR Case Study Final Presentation
No ratings yet
IR Case Study Final Presentation
12 pages
Social Media Se
No ratings yet
Social Media Se
3 pages
Twitter Sentiment Analysis by Robin Singh
No ratings yet
Twitter Sentiment Analysis by Robin Singh
57 pages
Sentimental Analysis On Twitter Data Using Naive Bayes: Ijarcce
No ratings yet
Sentimental Analysis On Twitter Data Using Naive Bayes: Ijarcce
4 pages
Twitter Sentimental Analysis
No ratings yet
Twitter Sentimental Analysis
5 pages
Senti bp1
No ratings yet
Senti bp1
2 pages
proposalwriting
No ratings yet
proposalwriting
16 pages
Twitter Sentiment Analysis Research Paper
No ratings yet
Twitter Sentiment Analysis Research Paper
5 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
11 pages
10 1109@icaccs48705 2020 9074208
No ratings yet
10 1109@icaccs48705 2020 9074208
3 pages
(IJIT-V6I4P8) :nikita R. Dandwate, Sarika B. Solanke
No ratings yet
(IJIT-V6I4P8) :nikita R. Dandwate, Sarika B. Solanke
5 pages
Twitter Sentiment Analysis Project Report Compressed
No ratings yet
Twitter Sentiment Analysis Project Report Compressed
33 pages
TOXIC_COMMENT_CLASSIFICATION_SYSTEM_USING_DEEP_LEA
No ratings yet
TOXIC_COMMENT_CLASSIFICATION_SYSTEM_USING_DEEP_LEA
6 pages
Fin Irjmets1715854730
No ratings yet
Fin Irjmets1715854730
8 pages
IJCRT2207068
No ratings yet
IJCRT2207068
5 pages
Freport
No ratings yet
Freport
25 pages
Dataset Analysis Using Keyword Searching in Twitter Data: Inderprastha Engineering College, Ghaziabad
No ratings yet
Dataset Analysis Using Keyword Searching in Twitter Data: Inderprastha Engineering College, Ghaziabad
4 pages
Sentiment Analysis of Twitter Data
No ratings yet
Sentiment Analysis of Twitter Data
1 page
Project Report
No ratings yet
Project Report
10 pages
Sentiment Analysis For Promotional Campaigns: 1 Sameer Mulani 2 Nikhat Pathan
No ratings yet
Sentiment Analysis For Promotional Campaigns: 1 Sameer Mulani 2 Nikhat Pathan
3 pages
Akshada Tweet Report With Pages Removed
No ratings yet
Akshada Tweet Report With Pages Removed
15 pages
Minor Project Report grp 11 (2)
No ratings yet
Minor Project Report grp 11 (2)
21 pages
Vaibhav DSBDA Project
No ratings yet
Vaibhav DSBDA Project
16 pages
571 Document Mod
No ratings yet
571 Document Mod
30 pages
Sentiment of tweets
No ratings yet
Sentiment of tweets
7 pages
Se Write-Up
No ratings yet
Se Write-Up
2 pages
Vol 7 No 1 - November 2013
No ratings yet
Vol 7 No 1 - November 2013
105 pages
Uno
No ratings yet
Uno
6 pages
Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
From Everand
Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
Mathangi Sri Ramachandran
No ratings yet
IBM Merged
No ratings yet
IBM Merged
13 pages
IVR Service Report
No ratings yet
IVR Service Report
2 pages
SM SEC A Group 07
No ratings yet
SM SEC A Group 07
7 pages
Sales Playbook
No ratings yet
Sales Playbook
12 pages
Group 4 - Patanjali Presentation
No ratings yet
Group 4 - Patanjali Presentation
7 pages
SM-II Group 09
No ratings yet
SM-II Group 09
7 pages
Dr. Ashish Chhabra
No ratings yet
Dr. Ashish Chhabra
1 page
Group 5 Patanjali PDF
No ratings yet
Group 5 Patanjali PDF
8 pages
Dream Big
No ratings yet
Dream Big
5 pages
Tesla Group 1 Section A
No ratings yet
Tesla Group 1 Section A
7 pages
SMII Group 3
No ratings yet
SMII Group 3
7 pages
Should Maruti Suzuki Invest in Electric Cars
No ratings yet
Should Maruti Suzuki Invest in Electric Cars
19 pages
E-Commerce-Ant Financial
No ratings yet
E-Commerce-Ant Financial
11 pages
Aditya 2
No ratings yet
Aditya 2
1 page
BKC Internship Certificate - Daksh Malhotra
No ratings yet
BKC Internship Certificate - Daksh Malhotra
1 page
Electronic Receipt Application Number: D217157: I Accept That Fees Paid Is Non Refundable
No ratings yet
Electronic Receipt Application Number: D217157: I Accept That Fees Paid Is Non Refundable
1 page
Imi New Delhi Only / Imi New Delhi + Imi Kolkata And/Or Imi Bhubaneswar
No ratings yet
Imi New Delhi Only / Imi New Delhi + Imi Kolkata And/Or Imi Bhubaneswar
3 pages
Zebra Ze500 Series Print Engines Maintenance Manual
No ratings yet
Zebra Ze500 Series Print Engines Maintenance Manual
892 pages
Symbianize: Forum Guidelines
No ratings yet
Symbianize: Forum Guidelines
20 pages
Stqa Unit I
No ratings yet
Stqa Unit I
15 pages
Mira Al Ghfeli Resume 2024
No ratings yet
Mira Al Ghfeli Resume 2024
2 pages
108-NDR-Network-Detection-and-Response-ATA-2024
No ratings yet
108-NDR-Network-Detection-and-Response-ATA-2024
44 pages
WikiStart - OsmocomBB - Open Source Mobile Communications
No ratings yet
WikiStart - OsmocomBB - Open Source Mobile Communications
3 pages
CSE2045Y - Lecture 9 - AJAX and XML DOM
No ratings yet
CSE2045Y - Lecture 9 - AJAX and XML DOM
25 pages
Service Bulletin: JET Is A Registered Trademark of L-3 Avionics Systems, Inc
No ratings yet
Service Bulletin: JET Is A Registered Trademark of L-3 Avionics Systems, Inc
5 pages
" A Puzzle A Day To Learn, Code, and Play " Visit: Description Example
No ratings yet
" A Puzzle A Day To Learn, Code, and Play " Visit: Description Example
1 page
Hoja Tecnica - ADATA SE770G
No ratings yet
Hoja Tecnica - ADATA SE770G
2 pages
HTTPS:WWW - Adobe.com:support:products:enterprise:knowledgecenter:media:c4611 Sample Explain
No ratings yet
HTTPS:WWW - Adobe.com:support:products:enterprise:knowledgecenter:media:c4611 Sample Explain
4 pages
Bhuvaneshwari Panchakam Bhuvaneshwari Pratah Smaranam Oriya PDF File12657
No ratings yet
Bhuvaneshwari Panchakam Bhuvaneshwari Pratah Smaranam Oriya PDF File12657
3 pages
SXAXS
No ratings yet
SXAXS
5 pages
Horizon™ 7600: Installation and User's Guide
No ratings yet
Horizon™ 7600: Installation and User's Guide
57 pages
Getting The Most From The Excel Solver
No ratings yet
Getting The Most From The Excel Solver
49 pages
Sets _ DPP 01 __ Uday 2026 (Class 11th)
No ratings yet
Sets _ DPP 01 __ Uday 2026 (Class 11th)
5 pages
Qbasic
No ratings yet
Qbasic
6 pages
Liquid Crystal Display Television Service Manual: Chassis MST6E182VS
No ratings yet
Liquid Crystal Display Television Service Manual: Chassis MST6E182VS
42 pages
Brainchild VR18 Manual
No ratings yet
Brainchild VR18 Manual
96 pages
About The Pallet Design System - Palletdesignsystem
No ratings yet
About The Pallet Design System - Palletdesignsystem
16 pages
IPC Global: Firmware Version 2.04
No ratings yet
IPC Global: Firmware Version 2.04
16 pages
Arya Report
No ratings yet
Arya Report
29 pages
22AB 3-1 Lab External Schedule Nov'24 (CSE, CSC & CSO) - 1
No ratings yet
22AB 3-1 Lab External Schedule Nov'24 (CSE, CSC & CSO) - 1
1 page
Creating A DIY Variometer For A Paraglider Using An Arduino Mini
No ratings yet
Creating A DIY Variometer For A Paraglider Using An Arduino Mini
5 pages
Hassan Zulfiqar Haider 0323414090
No ratings yet
Hassan Zulfiqar Haider 0323414090
7 pages
Whirlpool Washer Diagnostic Mode
No ratings yet
Whirlpool Washer Diagnostic Mode
7 pages
Frameview User Guide 1 1 Web
No ratings yet
Frameview User Guide 1 1 Web
30 pages
Flexible Operating System Internals
100% (1)
Flexible Operating System Internals
362 pages
How To Install Lightroom Presets
No ratings yet
How To Install Lightroom Presets
6 pages

Twitter Sentiment Analysis - Final - Report Copy Sahil

Uploaded by

Twitter Sentiment Analysis - Final - Report Copy Sahil

Uploaded by

Twitter Sentiment Analysis

INDUSTRIAL TRAINING PROJECT REPORT

Submitted in partial fulfilment of the requirements for the Degree

DEPARTMENT OF INFORMATION TECHNOLGY

Date: Signature of the Candidate

Title Page No.

CHAPTER 2 OVERALL DESCRIPTION

2.1 Project Perspective...........................................................................13

CHAPTER 3 SYSTEM REQUIREMENTS

CHAPTER 4 CONCLUSION AND FUTURE WORK

Figure 1: Proposed System of Sentiment Analysis…........................................................9

Figure 2: Keys and Tokens from Twitter Dev Console...................................................14

Figure 3: Procedural flow chart of Sentiment Analysis...................................................15

Figure 4: Architecture Diagram of Sentiment Analysis...................................................16

Figure 1: Proposed System

Given a message, classify whether the message is of positive, negative, or neutral

Does Twitter’s additional user information improve the classiﬁcation accuracy?

In the data analysis, the performance of a classiﬁcation algorithm is often domain

Sentiment analysis of public is highly critical in macro-scale socioeconomic phenomena

 Consumers can use sentiment analysis to research products or services before

What is sentiment analysis?

Sentiment Analysis is the process of ‘computationally’ determining whether a piece of

Why sentiment analysis?

▪ Business: In marketing field companies use it to develop their strategies, to understand

▪ Politics: In political field, it is used to keep track of political view, to detect

2.1 Project Perspective

2.2 Project Functions

Install it using following pip command:

pip install tweepy

 TextBlob: textblob is the python library for processing textual data.

Install it using following pip command:

pip install textblob

Keys and tokens

Consumer API keys: -

nhIjH1RpNRRnIvLrp5BPmt9Sw26DbeYcg0h6vRnfxkLTsv571V (API secret key)

Access token & access token secret: -

wzt6479cP7gSoGOCjpLriDZyv6k57u91S46VFLfFi0LAL (Access token secret)

Figure 2: Procedural flow chart of Sentiment Analysis

The key challenges for sentiment analysis are: -

 Twitter - abbreviations, lack of capitals, poor spelling, poor punctuation, poor

The following Assumptions are:

 It assumes that the collected data is overall overview of people on Twitter.

 It considers all emoticons as special characters having no sentiment.

 It does not consider grammar or structure of tweet as it tokenizes

3.1 External Interface Requirement

3.1.2 Software Interface

3.2 Functional Requirement

o Keywords will be entered by the user for each topic.

3.3 Non Functional Requirement

3.4 Hardware and Performance Requirement

Donald Trump's longtime business c…

Future work for project:

 Using different other models and algorithms.

 Graphical Visualization can be added as a future work in

 Consideration of Retweets as a factor.

[2] P. Turney., “Thumbs up or thumbs down?” Semantic orientation applied to

[7] A. Kennedy, D. Inkpen,. “Sentiment Classiﬁcation of Movie and Product Reviews

[8] J. Kamps, M. Marx, R. Mokken., ”Using WordNet to Measure Semantic Orientation

[9] V. Hatzivassiloglou, and J. Wiebe., “Eﬀects of Adjective Orientation and Gradability

Following is the code used while doing Sentiment Analysis:-

def clean_tweet(self, tweet):

def get_tweet_sentiment(self, tweet):

def get_tweets(self, query, count = 10):

# parsing tweets one by one

# saving text of tweet

# return parsed tweets

# picking positive tweets from tweets

# printing first 5 negative tweets

You might also like