0% found this document useful (0 votes)
11 views

1 Introduction

Uploaded by

ameywani2012
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

1 Introduction

Uploaded by

ameywani2012
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 99

Introducing

NLP
Concept
History
Applications
Approaches
Language Challenges
ChatGPT vs Human
Interaction using NLP

 Human talks to the machine


 Machine captures the audio
 Audio to text conversion takes place
 Processing of text data
 Data to audio conversion takes place
 Machine responds to human by playing audio file
Branch of Computer Science & AI that gives computers the ability
to understand text and spoken words in the same way as human
beings can.
Constituents of NLP
Natural Language Understanding

 Lexical Ambiguity
 He is looking for a match
 The fish is ready to eat
 Syntatical Ambiguity
 They are cooking apples
NLP pipeline (pre-processing)
NLP pipeline
Segmentation
Tokenization
Removing Stop Words
Stemming
Lemmatization
Part of Speech Tagging
Named Entity Recognition
Information Retrieval vs Information Extraction
Information Retrieval vs Information Extraction
Phases of NLP
Phases of NLP
Lexical Analysis

Lexicon of a language – collection of words & phrases

Divide text – paragraphs, sentence, words – lexicon normalization required

Lexicon normalization – Stemming, Lemmatization

Stemming – reduce derived words to root form like ing, ly, es, s

Lemmatization – Reduce to lemma or dictionary form


Syntactic Analysis

Used to check grammar, interrelationship between words

“Mumbai goes to Sara” rejected by Syntactic analyzer

Dependency Grammar and POS used


Semantic Analysis

Retrieve meaningful insights from text

Process of extracting meaning from text

Combines machine learning and natural language processing to find the


text’s main ideas and connections
Utilize pre-established ontologies and structured databases of concepts and
relationships in a particular subject
Where does Semantic Analysis work?
Discourse Integration

Understand sense of context – use of proper nouns / pronouns

Ram wants it

“it” keyword does not make sense

Depends on previous sentence


Pragmatic Analysis

Study of meaning – extract insight from text

Understand context, who said to whom

“it” keyword does not make sense

Depends on previous sentence

"What do you want to eat?" and another responded, "Ice cream is good
this time of year."
What is NLP?
What is
NLP?
Applications of NLP
Applications of NLP

 Alexa – understand words, OK Google, SIRI


 We say Play a song
 Speech – text – text – speech

 Word processing
 Language translation
 Grammarly for correction of grammar
 Interactive voice response in call centres
Question

 What is the main challenge of NLP?


 a)Handling Ambiguity of Sentences
b) Handling Tokenization
c) Handling POS-Tagging
d) All of the mentioned
Answer

 What is the main challenge of NLP?


 a)Handling Ambiguity of Sentences
b) Handling Tokenization
c) Handling POS-Tagging
d) All of the mentioned
Question

 Choose form the following areas where NLP can be useful.


a) Automatic Text Summarization
b) Automatic Question-Answering Systems
c) Information Retrieval
d) All of the mentioned
Answer

 Choose form the following areas where NLP can be useful.


a) Automatic Text Summarization
b) Automatic Question-Answering Systems
c) Information Retrieval
d) All of the mentioned
History of NLP
History of NLP
Chatbots

JABBER Dr. SmartC


ELIZA PARRY A.L.I.C.E
WACKY Sbaitso hild
1966 1972 1995
1988 1992 2001

Google
SIRI Cortana ALEXA ChatGPT
Assistant
2010 2014 2014 2021
2017
Why Tokenization?

 Text Preprocessing : remove unnecessary words


 Feature extraction : token used as a feature
 Word-level analysis : POS, NER
 Text Representation : using bag-of-words, word embeddings
 Language Processing : Parsing, Stemming, Lemmatization
 Efficient storage and retrieval : reduce dimensionality
 Statistical Analysis: Calculate word frequency to extract
information
Stemming
Stemming
Stemming
Stemming
Morphology
Morphology
Morphology
Types of Morpheme
Free Morpheme
Lexical Morpheme
Lexical Morpheme
Functional Morpheme
Functional Morpheme
Bound Morpheme
Bound Morpheme
Bound Morpheme
Bound Morpheme
Derivational Morpheme
Derivational Morpheme
Derivational Morpheme
Derivational Morpheme
Inflexional Morpheme
Inflexional Morpheme
Morpheme
Classify Morpheme – Free, Bound,
Inflexional, Derivational
Token Morpheme Type
Happy

Book

Ness in happiness

Ed in walked
Classify Morpheme – Free, Bound

Token Morpheme Type


Happy Free - lexical

Book Free - lexical

Ness in happiness Bound - derivational


Ed in walked Bound - inflexional
Language Challenges
Main Approaches in NLP

 Rule Based Methods


▪ Regular Expressions
▪ Context-free grammar
 Probablistic modeling & Machine Learning
▪ Likelihood maximization
▪ Linear classifiers
 Deep Learning
▪ Recurrent Neural Network
▪ Convolutional Neural Network
Main Approaches in NLP

 Rule Based Methods


▪ Regular Expressions
▪ Context-free grammar
 Probablistic modeling & Machine Learning
▪ Likelihood maximization
▪ Linear classifiers
 Deep Learning
▪ Recurrent Neural Network
▪ Convolutional Neural Network
Main Approaches in NLP

 Rule Based Methods


▪ Regular Expressions
▪ Context-free grammar
 Probablistic modeling & Machine Learning
▪ Likelihood maximization
▪ Linear classifiers
 Deep Learning
▪ Recurrent Neural Network
▪ Convolutional Neural Network
Main Approaches in NLP

 Rule Based Methods


▪ Regular Expressions
▪ Context-free grammar
 Probablistic modeling & Machine Learning
▪ Likelihood maximization
▪ Linear classifiers
 Deep Learning
▪ Recurrent Neural Network
▪ Convolutional Neural Network

You might also like