ps1 Notes

The document outlines two primary applications of AI: suggesting messages through text generation and content moderation using sentiment analysis and NLP. It details the processes involved in each application, including model training, contextual understanding, and offensive language detection techniques. Additionally, it discusses the use of various models and algorithms for enhancing message suggestions and moderating content effectively.

Uploaded by

writetosyd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views2 pages

ps1 Notes

Uploaded by

writetosyd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

2 ways we’re using AI:

1. Suggesting Messages (Text Generation)

2. Content Moderation (Sentiment Analysis & NLP)

Suggesting Messages
Context Awareness
Natural Language Processing (NLP) models like BERT and GPT enhance
understanding of context to reduce false positives. Steps:
● Preprocessing: Tokenize and clean the text. Convert text into subword
tokens if using BERT/GPT.
● Model Loading: Load a pre trained BERT or GPT model fine-tuned for
harmful content detection.
● Embedding Generation: Convert text into contextual embeddings that
capture semantic meaning.
● Contextual Understanding: Leverage attention mechanisms to understand
relationships between words and phrases in the text.
● Prediction: Pass embeddings through classification layers to determine
harmfulness.
● Fine-Tuning: Update the model using labeled examples specific to the
application.
● Deployment: Integrate the model into the messaging system for live analysis.

Also for suggesting messages part, keep in mind:

● Transformer Architecture: Both models rely on self-attention mechanisms, which
allow them to capture long-range dependencies in text efficiently. This makes the
models adept at understanding the context of a conversation or a user prompt.
● Autoregressive Decoding: In message suggestion tasks, the models generate
tokens sequentially. The generation of each token depends on the previously
generated tokens and the input prompt. This ensures the output is coherent and
contextually appropriate.
● Prompt Conditioning: The models are conditioned with a well-crafted prompt (e.g.,
“Generate three open-ended questions”). This guides the generation process to align
the output with the desired structure and intent, such as creating engaging social
media messages.
● Tokenization: Before processing, input text is divided into smaller units called
tokens. The model processes these tokens to predict the next token iteratively,
forming a complete response.
● Beam Search or Sampling: For diverse and creative outputs, methods like Top-k
Sampling or Nucleus Sampling are often employed. These introduce randomness by
selecting from the top-ranked tokens at each step, ensuring variability in generated
messages while maintaining relevance.
Content Moderation
Sentimental Analysis
While sentiment analysis typically categorizes text as positive, negative, or neutral, it
can be adapted to detect offensive language by associating specific negative sentiments or
aggressive tones with harmful content. The model might be fine-tuned on datasets where
aggressive or hateful speech is labeled as "negative sentiment," triggering the detection of
offensive language.
● Lexicon-Based Approaches: In these approaches, a predefined dictionary (or
lexicon) of words associated with specific sentiments (e.g., positive, negative,
neutral) is used. Sentiment scores are calculated based on the presence of these
words in the text. Examples include:
○ VADER (Valence Aware Dictionary and sEntiment Reasoner): It is a
lexicon and rule-based sentiment analysis tool specifically tailored for social
media text. VADER detects not just positive or negative sentiment but also the
intensity and polarity of emotions, which can be useful in detecting offensive
or aggressive content.
○ SentiWordNet: A lexical resource for sentiment analysis that assigns
sentiment scores to words. This can be used to identify negative emotions or
offensive language.
● Rule-Based Sentiment Analysis: Rules are created to identify sentiment based on
patterns in text. For example, the presence of certain words (e.g., "hate," "violence,"
"abuse") might trigger a "negative" sentiment label. This approach can be useful in
detecting offensive or hateful language.

Offensive language blocking:

● Binary Classification: This is the most common technique where the model is
trained to classify text as either "offensive" or "non-offensive." The training data
consists of labeled examples, where each text instance is tagged with one of these
two labels. Popular algorithms for text classification include:
○ Logistic Regression
○ Naive Bayes
○ Support Vector Machines (SVM)
○ Neural Networks (especially deep learning models like CNNs and RNNs)
● LLM(Large Language Model) and LSTM(Long Short Term Memory):
○ LLMs like GPT (Generative Pre-trained Transformer) or BERT
(Bidirectional Encoder Representations from Transformers) are
pre-trained on massive text corpora and can be fine-tuned for specific tasks
such as harmful content detection. These models excel in understanding
context, nuances, and semantics.
○ LSTMs are a type of Recurrent Neural Network (RNN) designed to handle
sequential data effectively, such as sentences. They remember long-term
dependencies, making them suitable for analyzing the sequence and context
of words in a message.

Disruptive Technologies AI Lecture 3
No ratings yet
Disruptive Technologies AI Lecture 3
19 pages
Trolling Detection System Using Natural Language Processing (NLP)
No ratings yet
Trolling Detection System Using Natural Language Processing (NLP)
5 pages
Sentiment Analysis For Social Media
No ratings yet
Sentiment Analysis For Social Media
26 pages
Praveen Phase 3
No ratings yet
Praveen Phase 3
6 pages
Ai CH 4
No ratings yet
Ai CH 4
53 pages
SMA Unit 1
No ratings yet
SMA Unit 1
20 pages
FALLSEM2024-25 BCSE409L TH VL2024250101879 2024-11-12 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE409L TH VL2024250101879 2024-11-12 Reference-Material-I
19 pages
BERT for Social Media Sentiment Analysis
No ratings yet
BERT for Social Media Sentiment Analysis
34 pages
DL Paper
No ratings yet
DL Paper
11 pages
Notes MSC NLP
No ratings yet
Notes MSC NLP
36 pages
Seminar Report (SA)
No ratings yet
Seminar Report (SA)
24 pages
08 05 Lessonarticle
No ratings yet
08 05 Lessonarticle
4 pages
NLP Materia
No ratings yet
NLP Materia
29 pages
IEEE Conference Template 4 Copy - 3
No ratings yet
IEEE Conference Template 4 Copy - 3
15 pages
NLP Applications and Challenges Explained
No ratings yet
NLP Applications and Challenges Explained
24 pages
Text Classification For Social Media Posts
No ratings yet
Text Classification For Social Media Posts
19 pages
NLP Concepts for AI Students
No ratings yet
NLP Concepts for AI Students
22 pages
Natural Language Processing (NLP) For Big Data: Text Analysis and Sentiment Mining
No ratings yet
Natural Language Processing (NLP) For Big Data: Text Analysis and Sentiment Mining
22 pages
Minor Project Presentation
No ratings yet
Minor Project Presentation
16 pages
Natural Language Processing: Class X
No ratings yet
Natural Language Processing: Class X
18 pages
Unit 3 AI-ML Driven Data Science and Automation
No ratings yet
Unit 3 AI-ML Driven Data Science and Automation
49 pages
2020 Trac-1 22
No ratings yet
2020 Trac-1 22
7 pages
Social Media Content Moderation Using AI Ijariie23024
No ratings yet
Social Media Content Moderation Using AI Ijariie23024
7 pages
Text Generation for NLP Enthusiasts
No ratings yet
Text Generation for NLP Enthusiasts
36 pages
Sentiment Analysis & LLM Insights
No ratings yet
Sentiment Analysis & LLM Insights
24 pages
Understanding Sentiment Analysis Techniques
No ratings yet
Understanding Sentiment Analysis Techniques
12 pages
NLP 2
No ratings yet
NLP 2
86 pages
Module2.4 Text Processing
No ratings yet
Module2.4 Text Processing
17 pages
NLP (Natural Language Processing) Student Book
No ratings yet
NLP (Natural Language Processing) Student Book
16 pages
AI in Automated Content Moderation On Social Media
No ratings yet
AI in Automated Content Moderation On Social Media
8 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
15 pages
Social Media Content Analyzer Tool
No ratings yet
Social Media Content Analyzer Tool
3 pages
Kunal DM
No ratings yet
Kunal DM
3 pages
Synopsis 6th Sem
No ratings yet
Synopsis 6th Sem
5 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
9 pages
### Seminar Report
No ratings yet
### Seminar Report
12 pages
Shivamani
No ratings yet
Shivamani
63 pages
05b.BDA (18CS72) Module-5 Text Mining
No ratings yet
05b.BDA (18CS72) Module-5 Text Mining
23 pages
Sentiment Analysis of Social Media Reviews
No ratings yet
Sentiment Analysis of Social Media Reviews
9 pages
Content Moderation Using NLP
No ratings yet
Content Moderation Using NLP
10 pages
Applications of NLP
No ratings yet
Applications of NLP
6 pages
Opinion Text Analysis Using Artificial Intelligence
No ratings yet
Opinion Text Analysis Using Artificial Intelligence
7 pages
Irjiet Inspire250221745452994
No ratings yet
Irjiet Inspire250221745452994
7 pages
Sentiment Analysis of Social Media Using Artificia
No ratings yet
Sentiment Analysis of Social Media Using Artificia
17 pages
AIML8P
No ratings yet
AIML8P
23 pages
M3-Social Media Text Analytics
No ratings yet
M3-Social Media Text Analytics
19 pages
Analyzing Sentiment Using IMDb Dataset
No ratings yet
Analyzing Sentiment Using IMDb Dataset
4 pages
CPCS335 - Chapter 10-Final
No ratings yet
CPCS335 - Chapter 10-Final
27 pages
Twitter Hate Speech Detection Guide
No ratings yet
Twitter Hate Speech Detection Guide
6 pages
Topic 2: Introduction To Natural Language Processing (NLP)
No ratings yet
Topic 2: Introduction To Natural Language Processing (NLP)
16 pages
SMA Final
No ratings yet
SMA Final
3 pages
Lec # 8
No ratings yet
Lec # 8
23 pages
02.MOUDLE 5 - Text Mining
No ratings yet
02.MOUDLE 5 - Text Mining
27 pages
ML 11
No ratings yet
ML 11
13 pages
Deep Learning For Automated Sentiment Analysis of Social Media
No ratings yet
Deep Learning For Automated Sentiment Analysis of Social Media
4 pages
Maslej-Krešňáková Et Al. - 2020 - Comparison of Deep Learning Models and Various Text Pre-Processing Techniques For The Toxic Comments C-Annotated
No ratings yet
Maslej-Krešňáková Et Al. - 2020 - Comparison of Deep Learning Models and Various Text Pre-Processing Techniques For The Toxic Comments C-Annotated
26 pages
Text Processing Guide for NLP
No ratings yet
Text Processing Guide for NLP
15 pages
Final Journal Paper Hate Speech-1 - Rathee Meenatchi
No ratings yet
Final Journal Paper Hate Speech-1 - Rathee Meenatchi
10 pages
NLP Handwritten Notes
No ratings yet
NLP Handwritten Notes
26 pages
Mobile App Development Intro
No ratings yet
Mobile App Development Intro
10 pages
Jessica Florac - Resume 2017
No ratings yet
Jessica Florac - Resume 2017
1 page
Glad Beguns
No ratings yet
Glad Beguns
7 pages
Web-School ERP 5.0: Complete School Management
No ratings yet
Web-School ERP 5.0: Complete School Management
10 pages
HyreSnap-Akshay Kamte Resume
No ratings yet
HyreSnap-Akshay Kamte Resume
3 pages
Xcom 2
No ratings yet
Xcom 2
9 pages
MA300+User+Manual+V2.0 20150108
No ratings yet
MA300+User+Manual+V2.0 20150108
48 pages
1 (FG10) U5 Test
No ratings yet
1 (FG10) U5 Test
24 pages
TrueBeam Jaw Tracking
100% (1)
TrueBeam Jaw Tracking
4 pages
Hiding Routing Information
No ratings yet
Hiding Routing Information
14 pages
C Programming Course Overview
No ratings yet
C Programming Course Overview
2 pages
Capfm Defense VF
No ratings yet
Capfm Defense VF
79 pages
9800 WLC
No ratings yet
9800 WLC
173 pages
Telephone IP - OptiPoint 400
No ratings yet
Telephone IP - OptiPoint 400
6 pages
Bscit 102
No ratings yet
Bscit 102
2 pages
Ethical Hacking - UNIT 2-2
No ratings yet
Ethical Hacking - UNIT 2-2
9 pages
007 Skill Based Core 1 - Programming With PHP and SQL - III Sem
No ratings yet
007 Skill Based Core 1 - Programming With PHP and SQL - III Sem
39 pages
SIM7070 Hardware Design V1.03
No ratings yet
SIM7070 Hardware Design V1.03
72 pages
Common Boot Options For Chimera, Chameleon, Unibeast, Niresh, and More
No ratings yet
Common Boot Options For Chimera, Chameleon, Unibeast, Niresh, and More
3 pages
SOM Handwritten Class Notes
No ratings yet
SOM Handwritten Class Notes
130 pages
Companies at SB RD
No ratings yet
Companies at SB RD
7 pages
Second Periodical Test in Empowerment Technologies GRADE 11, S.Y. 2019 - 2020
No ratings yet
Second Periodical Test in Empowerment Technologies GRADE 11, S.Y. 2019 - 2020
3 pages
Micheal Joy Linda
No ratings yet
Micheal Joy Linda
22 pages
Primavera Interview Q&A Guide
No ratings yet
Primavera Interview Q&A Guide
5 pages
Managing Exchange Rates in SAP
No ratings yet
Managing Exchange Rates in SAP
4 pages
MOOC Courses for Credit Transfer
No ratings yet
MOOC Courses for Credit Transfer
4 pages
IBM I2 Analyst'S Notebook: To Importing
No ratings yet
IBM I2 Analyst'S Notebook: To Importing
4 pages
Game Log
No ratings yet
Game Log
45 pages
Stefanos Moschidis - SAP Certified Technical Consultant - How To Build A Form With Smartforms
No ratings yet
Stefanos Moschidis - SAP Certified Technical Consultant - How To Build A Form With Smartforms
49 pages
Custom Friendster Layout CSS Guide
100% (3)
Custom Friendster Layout CSS Guide
2 pages

ps1 Notes

Uploaded by

ps1 Notes

Uploaded by

2 ways we’re using AI:

1. Suggesting Messages (Text Generation)

Also for suggesting messages part, keep in mind:

Offensive language blocking:

You might also like