Back Exercises
Cognitive Behavioural Therapy
Auto Text Summarisation
Script-Bot
Conversational Agents
Auto Text Summarisation
Text Classification
Extractive Summarisation Abstractive Summarisation
Programming language
syntax
Corpus
True
False
False
True
False
False
False
True
True
False
Smart-Bot
Sentiment Analysis
Humans Language
token
Stemming
Bag of Words
Tokenisation
TFIDF
Text Normalisation
Lemmatization
False
False
False
False
True
True
False
True
False
False
D. Short answer type questions
1. Give in brief the steps involved in the BOW algorithm
The Steps involved in Bag of Words algorithm are:
• Text Normalisation: The collection of data is processed to get normalised corpus.
• Create Dictionary: This step will create a list of all unique words available in normalized corpus.
• Create Document Vectors: For each document in the corpus, create a list of unique words with its
number of occurrences.
• Create Document Vectors for all the Documents: Repeat Step 3 for all documents in the corpus to
create a “Document Vector Table”.
2. What are Chatbots? Name a few
Ans. Chatbot can be defined as an application that automates your tasks like saying good morning when
you wake up, telling you news on a daily basis, helping you in choosing a less traffic route for your school,
ordering a coffee for you on your way back home. Mitsuku bot, Haptik, Ochatbot etc.
3. Give two important applications of Chatbots.
• It helping you in choosing a less traffic route for your school.
• It ordering a coffee for you on your way back home.
4. What is Automatic Text Summarization?
Ans. Automatic Text Summarization is the process of creating the most meaningful and relevant
summary of voluminous texts from multiple resources.
E. Long answer type questions
1. Differentiate between the following
a. Document Frequency and Inverse Document Frequency
Ans. Document Frequency is the number of documents in which the word occurs irrespective of how many
times it has occurred in those documents. On the other hand, Inverse Document Frequency is obtained
when document frequency is in the denominator and the total number of documents is the numerator.
b. BoW and TFIDF
Ans. Bag of Words is a simple and important technique used in Natural Language Processing for extracting
features from the textual data.
On the other hand, Term Frequency and Inverse Document Frequency (TFIDF) method is considered better
than the Bag of Words algorithm because BoW gives the numeric vector of each word in the document but
TFIDF through its numeric value gives us the importance of each word in the document.
c. Extractive Summarization and Abstractive Summarization
Ans. Extractive summarization: In this the selected text, phrases, sentences or sections are picked up from
the scattered resources and joined appropriately to form a concise summary.
Abstractive Summarization: In this, the summary is created by interpreting the text from multiple
resources using advanced NLP techniques. This new summary may or may not have text, phrases or
sentences from the original documents.
d. Human Language and Computer Language
Ans Humans Language is the language used by humans to interact with the people around them. Whereas,
Computer Language is a language used by the programmers to develop a computer program which helps
humans to interact with an electronic device-computer.
E. Long answer type questions
2. What is Data Acquisition in AI Project Cycle?
Ans. Data Acquisition means collecting raw data for the purpose of reference or analysis for the project.
This is the second stage of the AI project cycle. It is the process of collecting data required for training the
AI project. Data is raw information that is used to generate meaningful outcomes.
3. Explain any two applications of TFIDF
Ans. Some of the important applications of TFIDF are:
• Document Classification: It helps in the classification of the documents scattered in the internet based on
their types, genre, etc.
• Topic Modelling: It helps in predicting the topic of the corpus.