0% found this document useful (0 votes)
43 views15 pages

Natural Language Processing - Back Exercises

Uploaded by

vibhansh.garg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views15 pages

Natural Language Processing - Back Exercises

Uploaded by

vibhansh.garg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Back Exercises

Cognitive Behavioural Therapy

Auto Text Summarisation

Script-Bot

Conversational Agents

Auto Text Summarisation

Text Classification

Extractive Summarisation Abstractive Summarisation

Programming language
syntax

Corpus

True

False
False

True

False

False

False
True

True

False
Smart-Bot

Sentiment Analysis

Humans Language

token

Stemming

Bag of Words

Tokenisation
TFIDF

Text Normalisation

Lemmatization

False
False
False
False

True
True
False
True
False
False
D. Short answer type questions

1. Give in brief the steps involved in the BOW algorithm


The Steps involved in Bag of Words algorithm are:
• Text Normalisation: The collection of data is processed to get normalised corpus.
• Create Dictionary: This step will create a list of all unique words available in normalized corpus.
• Create Document Vectors: For each document in the corpus, create a list of unique words with its
number of occurrences.
• Create Document Vectors for all the Documents: Repeat Step 3 for all documents in the corpus to
create a “Document Vector Table”.

2. What are Chatbots? Name a few


Ans. Chatbot can be defined as an application that automates your tasks like saying good morning when
you wake up, telling you news on a daily basis, helping you in choosing a less traffic route for your school,
ordering a coffee for you on your way back home. Mitsuku bot, Haptik, Ochatbot etc.

3. Give two important applications of Chatbots.


• It helping you in choosing a less traffic route for your school.
• It ordering a coffee for you on your way back home.

4. What is Automatic Text Summarization?


Ans. Automatic Text Summarization is the process of creating the most meaningful and relevant
summary of voluminous texts from multiple resources.
E. Long answer type questions
1. Differentiate between the following
a. Document Frequency and Inverse Document Frequency
Ans. Document Frequency is the number of documents in which the word occurs irrespective of how many
times it has occurred in those documents. On the other hand, Inverse Document Frequency is obtained
when document frequency is in the denominator and the total number of documents is the numerator.
b. BoW and TFIDF
Ans. Bag of Words is a simple and important technique used in Natural Language Processing for extracting
features from the textual data.
On the other hand, Term Frequency and Inverse Document Frequency (TFIDF) method is considered better
than the Bag of Words algorithm because BoW gives the numeric vector of each word in the document but
TFIDF through its numeric value gives us the importance of each word in the document.
c. Extractive Summarization and Abstractive Summarization
Ans. Extractive summarization: In this the selected text, phrases, sentences or sections are picked up from
the scattered resources and joined appropriately to form a concise summary.
Abstractive Summarization: In this, the summary is created by interpreting the text from multiple
resources using advanced NLP techniques. This new summary may or may not have text, phrases or
sentences from the original documents.
d. Human Language and Computer Language
Ans Humans Language is the language used by humans to interact with the people around them. Whereas,
Computer Language is a language used by the programmers to develop a computer program which helps
humans to interact with an electronic device-computer.
E. Long answer type questions
2. What is Data Acquisition in AI Project Cycle?
Ans. Data Acquisition means collecting raw data for the purpose of reference or analysis for the project.
This is the second stage of the AI project cycle. It is the process of collecting data required for training the
AI project. Data is raw information that is used to generate meaningful outcomes.

3. Explain any two applications of TFIDF


Ans. Some of the important applications of TFIDF are:
• Document Classification: It helps in the classification of the documents scattered in the internet based on
their types, genre, etc.
• Topic Modelling: It helps in predicting the topic of the corpus.

You might also like