2
Most read
5
Most read
13
Most read
How Sentiment Analysis works in machines (an introduction) Marie-Claire Jenkins                     
What is it? It's software for automatically extracting opinions, emotions and sentiments in text. It allows us to track attitudes and feelings on the web. People write blog posts, comments, reviews and tweets about all sorts of different topics.  We can track products, brands and people for example and determine whether they are viewed positively or negatively on the web. 
We can analyse...                        acts: "The painting was more expensive than a Monet"                      pinions: "I honestly don't like Monet, Pollock is the better artist"
Why would we want to do this? It allows business to track:          - Flame detection (bad rants)          - New product perception          - Brand perception          - Reputation management          It allows individuals to get:      - An opinion on something (reviews) on a global scale     
Several fields of computing merge Natural language processing It deals with the actual text element. It transforms it into a format that the machine can use. Artificial intelligence It uses the information given by the NLP and uses a lot of maths to determine whether something is negative or positive: it is used for clustering.
The problem has several dimensions 1 - How does a machine define subjectivity & sentiment? 2 - How does a machine analyse polaraity (negative/positive)? 3 - How does a machine deal with subjective word senses? 4 - How does a machine assign an opinion rating? 5 - How does a machine know about sentiment intensity?
What is an opinion? " a personal belief or judgment that is not founded on proof or certainty" (WordNet) But: “ The fact that an opinion has been widely held is no evidence whatever that it is not utterly absurd.” (Bertrand Russell) Word of mouth is powerful though...
It's not always easy to differentiate between fact and opinion.
What is an opinion to a machine? It is a "quintuple", an object made up of 5 different things: O j  = The thing in question (i.e product)  f  jk  = a feature of O j SO  ijkl  = the sentiment value of the opinion of the opinion holder h i  on feature f jk  of object o j  at time t l These 5 elements have to be identified by the machine {defined by Bing Liu in the NLP handbook}
The Quintuple is hard to resolve O j  = Named Entity Extraction f  jk  = Information extraction SO  ijkl  = Sentiment analysis h i  = Information extraction T i  = Data extraction All of these problems are as yet unsolved in computer science {See Bing Liu}
Language is ambiguous Consider: " The watch isn't water resistant " - In a product review this could be negative. " As much use as a trapdoor on a lifeboat " - negative but not obvious to the machine. " The canon camera is better than the Fisher Price one " - comparisons are hard to classify. " imo the ice cream is luuurrrrrrvely " - slang and the way we communicate in general needs to be processed.
The process... 1 - Part-of-speech tagging (but also position and more): The word in the text (or the sentence) at tagged using a POS-tagger so that it assigns a label to each word, allowing the machine to do something with it. It looks something like this: S = subject VP = Verb Phrase V = Verb N = Noun NP = Noun Phrase PP = Preposition Det = Determiner Then we extract defined patterns like [Det] + [NN] for example
The process part 2 We look at sentiment orientation (SO) of the patterns we extracted. For example we may have extracted: Amazing + Phone which is:  [JJ] + [NN]  (or adjective followed by noun in human) The opposite might be "Terrible" for example. In this stage, the machines tries to situate the words on an emotive scale (so to speak).
The process part 3 The average Sentiment orientation of all the phrases we gathered is computed. This allows the machine to say something like: "Generally people like the new iphone" --> They recommend it or "Generally people hate the new iphone" --> They don't recommend it
Classifying sentiments This is very difficult but experiments have been done using:    -  Naive Bayes  (probabilistic classifier using Bayes theorem)                             -  Maximum Entropy  (Uses  probability distributions on the basis of partial knowledge )    -  Support vector machine  (Data is set as 2 vectors in an n-dimensional space) Pang et al.  found the SVM to be the most accurate classifier (around 80%). There are other methods being explored as well.
So does it work? The wider your throw the net and the more complex the language, the less accurate the system will be. This is simply due to the level of complexity it has to deal with. If you want to classifiy sentiments into +/- groups, then you are more likely to get a good result than if you are trying to classify into more exact groups (Excellent, incredible, good...). More granularity requires more accuracy and this in turn requires a deeper understanding of human language.  There are commercial systems in place at this time and also systems like  NaCTeM  in the research space.
Things to read Sentiment analysis in Text  (SFS) Opinion mining and sentiment analysis  (Bo Pang, Lillian Lee) Opinion Extraction, Summarization and Tracking in News and Blog Corpora   (Ku, Liang, Chen) Sentiment analysis and subjectivity  (Bing Liu) International sentiment analysis for news and blogs  (Bautin) Sentiment analysis: does coreference matter?  (Nikolov) CIKM Workshop on sentiment analysis
In the press Google and sentiment analysis  (SeoByTheSea) 5 ways sentiment analysis is ramping up in 2009  (RWW) Mining the web for feelings not facts  (NY Times) Is sentiment analysis reliable?  (Marketing Pilgrim) Sentimental searching  (Watching the watchers)
Tools for coders SentiWordNet LingPipe sentiment analysis Long list of tools at CodeSpeak The Toolkit for Advanced Discriminative Modeling (TADM) RapidMiner
Check out this beautiful visual sentiment system <3 
http://www.scienceforseo.com

More Related Content

PPTX
Sentiment Analysis
PDF
Sentiment Analysis
PDF
Introduction to Sentiment Analysis
PDF
Sentiment analysis - Our approach and use cases
PPTX
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
PDF
Text classification & sentiment analysis
PPTX
Approaches to Sentiment Analysis
PDF
Sentiment analysis of Twitter Data
Sentiment Analysis
Sentiment Analysis
Introduction to Sentiment Analysis
Sentiment analysis - Our approach and use cases
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
Text classification & sentiment analysis
Approaches to Sentiment Analysis
Sentiment analysis of Twitter Data

What's hot (20)

PPTX
Sentiment Analysis
PPTX
Sentiment analysis of Twitter data using python
PPTX
Sentiment analysis
PPTX
Social Media Sentiments Analysis
PPTX
Sentiment analysis
PPTX
Sentiment Analaysis on Twitter
PPTX
Sentiment Analysis in Twitter
PPTX
Sentiment analysis presentation
PPTX
Twitter sentiment analysis ppt
PPTX
Sentiment analysis
PDF
Amazon sentimental analysis
PPTX
Twitter sentiment analysis
PPTX
New sentiment analysis of tweets using python by Ravi kumar
PPTX
Twitter sentiment analysis
PDF
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
PDF
Twitter Sentiment Analysis.pdf
PPT
Social Media Sentiment Analysis
PPTX
Presentation on Sentiment Analysis
PPTX
Sentiment analysis of twitter data
PPTX
Twitter sentiment analysis ppt
Sentiment Analysis
Sentiment analysis of Twitter data using python
Sentiment analysis
Social Media Sentiments Analysis
Sentiment analysis
Sentiment Analaysis on Twitter
Sentiment Analysis in Twitter
Sentiment analysis presentation
Twitter sentiment analysis ppt
Sentiment analysis
Amazon sentimental analysis
Twitter sentiment analysis
New sentiment analysis of tweets using python by Ravi kumar
Twitter sentiment analysis
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Twitter Sentiment Analysis.pdf
Social Media Sentiment Analysis
Presentation on Sentiment Analysis
Sentiment analysis of twitter data
Twitter sentiment analysis ppt
Ad

Similar to How Sentiment Analysis works (20)

PDF
A Survey On Sentiment Analysis And Opinion Mining Techniques
PDF
A Survey on Sentiment Analysis and Opinion Mining.pdf
PPT
Opinion Mining
PPSX
Sentiment analysis
PDF
Supervised Approach to Extract Sentiments from Unstructured Text
PPT
Ml ppt
PPT
opinionmining-131221011849-phpapp02-converted.ppt
PPT
An overview of concepts of Sentiment Analysis
PPT
Sentiment+Analysis.ppt
PDF
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
PPT
Major
PPT
sa-mincut-aditya.ppt
PPTX
02 naive bays classifier and sentiment analysis
PDF
Introduction to Sentiment Analysis
PDF
Sentiment Analysis Using Hybrid Approach: A Survey
PPT
ppt on sentiment analysis using various techniques
PDF
project sentiment analysis
PDF
Sentiment Analysis in Hindi Language : A Survey
PDF
Sentiment Analysis using Naïve Bayes, CNN, SVM
A Survey On Sentiment Analysis And Opinion Mining Techniques
A Survey on Sentiment Analysis and Opinion Mining.pdf
Opinion Mining
Sentiment analysis
Supervised Approach to Extract Sentiments from Unstructured Text
Ml ppt
opinionmining-131221011849-phpapp02-converted.ppt
An overview of concepts of Sentiment Analysis
Sentiment+Analysis.ppt
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
Major
sa-mincut-aditya.ppt
02 naive bays classifier and sentiment analysis
Introduction to Sentiment Analysis
Sentiment Analysis Using Hybrid Approach: A Survey
ppt on sentiment analysis using various techniques
project sentiment analysis
Sentiment Analysis in Hindi Language : A Survey
Sentiment Analysis using Naïve Bayes, CNN, SVM
Ad

More from CJ Jenkins (7)

PPTX
I am an experience designer
PPT
Using construction grammar in conversational systems
ODP
Knowledgebase vs Database
ODP
Building a semantic website
ODP
Search Engine Spiders
PPT
Twitter for business
ODP
The search engine index
I am an experience designer
Using construction grammar in conversational systems
Knowledgebase vs Database
Building a semantic website
Search Engine Spiders
Twitter for business
The search engine index

Recently uploaded (20)

PPTX
MuleSoft-Compete-Deck for midddleware integrations
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
4 layer Arch & Reference Arch of IoT.pdf
PDF
Statistics on Ai - sourced from AIPRM.pdf
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PDF
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
PDF
Flame analysis and combustion estimation using large language and vision assi...
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PDF
Co-training pseudo-labeling for text classification with support vector machi...
PPTX
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
DOCX
Basics of Cloud Computing - Cloud Ecosystem
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PDF
Early detection and classification of bone marrow changes in lumbar vertebrae...
PDF
Comparative analysis of machine learning models for fake news detection in so...
PPTX
Training Program for knowledge in solar cell and solar industry
PDF
Lung cancer patients survival prediction using outlier detection and optimize...
PDF
Data Virtualization in Action: Scaling APIs and Apps with FME
DOCX
search engine optimization ppt fir known well about this
PDF
sustainability-14-14877-v2.pddhzftheheeeee
MuleSoft-Compete-Deck for midddleware integrations
Convolutional neural network based encoder-decoder for efficient real-time ob...
4 layer Arch & Reference Arch of IoT.pdf
Statistics on Ai - sourced from AIPRM.pdf
Enhancing plagiarism detection using data pre-processing and machine learning...
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
Flame analysis and combustion estimation using large language and vision assi...
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
sbt 2.0: go big (Scala Days 2025 edition)
Co-training pseudo-labeling for text classification with support vector machi...
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
Basics of Cloud Computing - Cloud Ecosystem
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
Early detection and classification of bone marrow changes in lumbar vertebrae...
Comparative analysis of machine learning models for fake news detection in so...
Training Program for knowledge in solar cell and solar industry
Lung cancer patients survival prediction using outlier detection and optimize...
Data Virtualization in Action: Scaling APIs and Apps with FME
search engine optimization ppt fir known well about this
sustainability-14-14877-v2.pddhzftheheeeee

How Sentiment Analysis works

  • 1. How Sentiment Analysis works in machines (an introduction) Marie-Claire Jenkins                     
  • 2. What is it? It's software for automatically extracting opinions, emotions and sentiments in text. It allows us to track attitudes and feelings on the web. People write blog posts, comments, reviews and tweets about all sorts of different topics.  We can track products, brands and people for example and determine whether they are viewed positively or negatively on the web. 
  • 3. We can analyse...                        acts: &quot;The painting was more expensive than a Monet&quot;                      pinions: &quot;I honestly don't like Monet, Pollock is the better artist&quot;
  • 4. Why would we want to do this? It allows business to track:          - Flame detection (bad rants)          - New product perception          - Brand perception          - Reputation management          It allows individuals to get:      - An opinion on something (reviews) on a global scale     
  • 5. Several fields of computing merge Natural language processing It deals with the actual text element. It transforms it into a format that the machine can use. Artificial intelligence It uses the information given by the NLP and uses a lot of maths to determine whether something is negative or positive: it is used for clustering.
  • 6. The problem has several dimensions 1 - How does a machine define subjectivity & sentiment? 2 - How does a machine analyse polaraity (negative/positive)? 3 - How does a machine deal with subjective word senses? 4 - How does a machine assign an opinion rating? 5 - How does a machine know about sentiment intensity?
  • 7. What is an opinion? &quot; a personal belief or judgment that is not founded on proof or certainty&quot; (WordNet) But: “ The fact that an opinion has been widely held is no evidence whatever that it is not utterly absurd.” (Bertrand Russell) Word of mouth is powerful though...
  • 8. It's not always easy to differentiate between fact and opinion.
  • 9. What is an opinion to a machine? It is a &quot;quintuple&quot;, an object made up of 5 different things: O j = The thing in question (i.e product)  f  jk = a feature of O j SO ijkl = the sentiment value of the opinion of the opinion holder h i on feature f jk of object o j at time t l These 5 elements have to be identified by the machine {defined by Bing Liu in the NLP handbook}
  • 10. The Quintuple is hard to resolve O j = Named Entity Extraction f  jk = Information extraction SO  ijkl = Sentiment analysis h i = Information extraction T i = Data extraction All of these problems are as yet unsolved in computer science {See Bing Liu}
  • 11. Language is ambiguous Consider: &quot; The watch isn't water resistant &quot; - In a product review this could be negative. &quot; As much use as a trapdoor on a lifeboat &quot; - negative but not obvious to the machine. &quot; The canon camera is better than the Fisher Price one &quot; - comparisons are hard to classify. &quot; imo the ice cream is luuurrrrrrvely &quot; - slang and the way we communicate in general needs to be processed.
  • 12. The process... 1 - Part-of-speech tagging (but also position and more): The word in the text (or the sentence) at tagged using a POS-tagger so that it assigns a label to each word, allowing the machine to do something with it. It looks something like this: S = subject VP = Verb Phrase V = Verb N = Noun NP = Noun Phrase PP = Preposition Det = Determiner Then we extract defined patterns like [Det] + [NN] for example
  • 13. The process part 2 We look at sentiment orientation (SO) of the patterns we extracted. For example we may have extracted: Amazing + Phone which is:  [JJ] + [NN] (or adjective followed by noun in human) The opposite might be &quot;Terrible&quot; for example. In this stage, the machines tries to situate the words on an emotive scale (so to speak).
  • 14. The process part 3 The average Sentiment orientation of all the phrases we gathered is computed. This allows the machine to say something like: &quot;Generally people like the new iphone&quot; --> They recommend it or &quot;Generally people hate the new iphone&quot; --> They don't recommend it
  • 15. Classifying sentiments This is very difficult but experiments have been done using:    - Naive Bayes  (probabilistic classifier using Bayes theorem)                             - Maximum Entropy  (Uses  probability distributions on the basis of partial knowledge )    - Support vector machine (Data is set as 2 vectors in an n-dimensional space) Pang et al. found the SVM to be the most accurate classifier (around 80%). There are other methods being explored as well.
  • 16. So does it work? The wider your throw the net and the more complex the language, the less accurate the system will be. This is simply due to the level of complexity it has to deal with. If you want to classifiy sentiments into +/- groups, then you are more likely to get a good result than if you are trying to classify into more exact groups (Excellent, incredible, good...). More granularity requires more accuracy and this in turn requires a deeper understanding of human language.  There are commercial systems in place at this time and also systems like NaCTeM  in the research space.
  • 17. Things to read Sentiment analysis in Text (SFS) Opinion mining and sentiment analysis (Bo Pang, Lillian Lee) Opinion Extraction, Summarization and Tracking in News and Blog Corpora (Ku, Liang, Chen) Sentiment analysis and subjectivity (Bing Liu) International sentiment analysis for news and blogs (Bautin) Sentiment analysis: does coreference matter? (Nikolov) CIKM Workshop on sentiment analysis
  • 18. In the press Google and sentiment analysis (SeoByTheSea) 5 ways sentiment analysis is ramping up in 2009 (RWW) Mining the web for feelings not facts (NY Times) Is sentiment analysis reliable? (Marketing Pilgrim) Sentimental searching (Watching the watchers)
  • 19. Tools for coders SentiWordNet LingPipe sentiment analysis Long list of tools at CodeSpeak The Toolkit for Advanced Discriminative Modeling (TADM) RapidMiner
  • 20. Check out this beautiful visual sentiment system <3