0% found this document useful (0 votes)

39 views7 pages

SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP

The document contains important questions related to Artificial Intelligence, specifically focusing on Natural Language Processing (NLP) for the academic session 2024-2025 at SKD Academy. It covers various topics such as sentiment analysis, chatbot definitions, text normalization, and the Bag of Words model, along with practical applications like TFIDF and confusion matrices. Additionally, it includes definitions, comparisons, and examples to aid in understanding key concepts in NLP.

Uploaded by

lemontech111

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views7 pages

SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP

Uploaded by

lemontech111

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

SKD Academy (CBSE)

Session – 2024-2025
Subject – Artificial Intelligence (417)
Important Questions
Chap - NLP
1. _________Information overload is a real problem when we need to access a specific,
important piece of information from a huge knowledge base.
a) Automatic Summarization
b) Sentiment Analysis
c) Text Classification
d) All of the above
2. The goal of sentiment analysis is to identify sentiment among several posts or even in the
same post where emotion is not always explicitly expressed.
a) Automatic Summarization
b) Sentiment Analysis
c) Text Classification
d) All of the above
3. By dividing up large problems into smaller ones, --------aims to help you manage them in
a more constructive manner.
a) CDP
b) CBT
c) CSP
d) CLP
4. Cognitive Behavioral Therapy includes .
a) Your Thoughts
b) Your Behaviors
c) Your Emotions
d) All of the above
5. Once the textual data has been collected, it needs to be processed and cleaned so that
an easier version can be sent to the machine. This is known as .
a) Data Acquisition
b) Data Exploration
c) Data Mining
d) None of the above
6. ________bots work around a script which is programmed in them.
a) Script-bot
b) Smart-bot
c) Both a) and b)
d) None of the above
7. ________work on bigger databases and other resources directly.
a) Script-bot
b) Smart-bot
c) Both a) and b)
d) None of the above
8. __________helps in cleaning up the textual data in such a way that it comes down to a
level where its complexity is lower than the actual data.
a) Speech Normalization
b) Text Normalization
c) Visual Normalization
d) None of the above
9. __________are the words which occur very frequently in the corpus but do not add any
value to it.
a) Tokens
b) Words
c) Stopwords
d) None of the above
10. Applications of TFIDF are - .
a) Document Classification
b) Topic Modelling
c) Information Retrieval System and Stop word filtering
d) All of the above
11. _______is the process in which the affixes of words are removed and the words are
converted to their base form.
a) Stemming
b) Stopwords
c) Case-sensitivity
d) All of the above
12. __________is a Natural Language Processing model which helps in extracting features
out of the text which can be helpful in machine learning algorithms.
a) Bag of Words
b) Big Words
c) Best Words
d) All of the above
13. Which of the following is not correct about NLP?
a) It is a sub field of AI.
b) It is focused on enabling computers to understand and process human languages.
c) It takes in the data of Natural Languages which humans use in their daily lives.
d) None of the above
14. One of the applications of Natural Language Processing is relevant when used to provide
an overview of a news item or blog post, while avoiding redundancy from multiple
sources and maximizing the diversity of content obtained. Identify the application from
the following
a) Sentiment Analysis
b) Virtual Assistants
c) Text classification
d) Automatic Summarization
15. The term used for the whole textual data from all the documents altogether is known as
a) Complete Data
b) Slab
c) Corpus
d) Cropus

1. What is a Chabot?
A chatbot is a computer program that's designed to simulate human conversation through
voice commands or text chats or both. Eg: Mitsuku Bot, Jabberwacky etc.
2. While working with NLP what is the meaning of?
a. Syntax
b. Semantics
Syntax: Syntax refers to the grammatical structure of a sentence.
Semantics: It refers to the meaning of the sentence.

3. What is the difference between stemming and lemmatization?

Stemming is the process in which the affixes of words are removed and the words are
converted to their base form.
It is just like cutting down the branches of a tree to its stems. For example, the stem of the
words eating, eats, eaten is eat.
Lemmatization is the grouping together of different forms of the same word. In
search queries, lemmatization allows end users to query any version of a base word
and get relevant results.

4. What is meant by a dictionary in NLP?

Dictionary in NLP means a list of all the unique words occurring in the corpus. If some words
are repeated in different documents, they are all written just once as while creating the
dictionary.

5. What is term frequency?

Term frequency is the frequency of a word in one document. Term frequency can easily
be found from the document vector table as in that table we mention the frequency of
each word of the vocabulary in each document.

6. Which package is used for Natural Language Processing in Python

programming? Natural Language Toolkit (NLTK). NLTK is one of the leading
platforms for building Python programs that can work with human language data.

7. What is a document vector table?

Document Vector Table is used while implementing Bag of Words algorithm.
In a document vector table, the header row contains the vocabulary of the corpus and other
rows correspond to different documents.

8. What do you mean by corpus?

In Text Normalization, we undergo several steps to normalize the text to a lower level.
That is, we will be working on text from multiple documents and the term used for the
whole textual data from all the documents altogether is known as corpus.

9. Differentiate between a script-bot and a smart-bot.

Script-bot Smart-bot
a) A scripted chatbot doesn’t a) Smart bots are built on NLP
carry even a glimpse of A.I and ML.
b) Script bots are easy to make b) Smart –bots are
comparatively difficult to
c) Script bot functioning is very make.
limited as they are less c) Smart-bots are flexible
powerful. and powerful.
d) Script bots work around a d) Smart bots work on
script which is programmed bigger databases and
in them other resources directly
e) Wide functionality
e) Limited functionality

10. What is inverse document frequency?

To understand inverse document frequency, first we need to understand document
frequency. Document Frequency is the number of documents in which the word occurs
irrespective of how many times it has occurred in those documents.
In case of inverse document frequency, we need to put the document frequency in the
denominator while the total number of documents is the numerator.
For example, if the document frequency of a word “AMAN” is 2 in a particular document
then its inverse document frequency will be 3/2. (Here no. of documents is 3)
11. What do you mean by document vectors?
Document Vector contains the frequency of each word of the vocabulary in a particular
document.
In document vector vocabulary is written in the top row. Now, for each word in the
document, if it matches with the vocabulary, put a 1 under it. If the same word appears
again, increment the previous value by 1. And if the word does not occur in that document,
put a 0 under it.

12. What is TFIDF?

Term frequency–inverse document frequency, is a numerical statistic that is intended to
reflect how important a word is to a document in a collection or corpus.
The number of times a word appears in a document divided by the total number of words in
the document. Every document has its own term frequency.

13. Which words in a corpus have the highest values and which ones have the
least?
Stop words like - and, this, is, the, etc. have highest values in a corpus. But these words do
not talk about the corpus at all. Hence, these are termed as stopwords and are mostly
removed at the pre- processing stage only.
Rare or valuable words occur the least but add the most importance to the corpus. Hence,
when we look at the text, we take frequent and rare words into consideration.

14. Does the vocabulary of a corpus remain the same before and after text
normalization? Why?

No, the vocabulary of a corpus does not remain the same before and after text
normalization. Reasons are –
a) In normalization the text is normalized through various steps and is lowered to
minimum vocabulary since the machine does not require grammatically correct statements
but the essence of it.
b) In normalization Stop words, Special Characters and Numbers are removed.
c) In stemming the affixes of words are removed and the words are converted to
their base form. So, after normalization, we get the reduced vocabulary.

15. Explain the concept of Bag of Words.

Bag of Words is a Natural Language Processing model which helps in extracting features out
of the text which can be helpful in machine learning algorithms. In bag of words, we get the
occurrences of each word and construct the vocabulary for the corpus.

16. Explain the relation between occurrence and value of a word.

As shown in the graph, occurrence and value of a word are inversely proportional. The
words which occur most (like stop words) have negligible value. As the occurrence of words
drops, the value of such words rises. These words are termed as rare or valuable words.
These words occur the least but add the most value to the corpus.

17. What are the applications of TFIDF?

TFIDF is commonly used in the Natural Language Processing domain. Some of its applications
are:
a) Document Classification - Helps in classifying the type and genre of a document.
b) Topic Modelling - It helps in predicting the topic for a corpus.
c) Information Retrieval System - To extract the important information out of a corpus.
d) Stop word filtering - Helps in removing the unnecessary words out of a text body.

18. Create a document vector table for the given corpus:

Document 1: We are going to Mumbai
Document 2: Mumbai is a famous place.
Document 3: We are going to a famous place.
Document 4: I am famous in Mumbai.

We Are going to Mumbai is a famous place I am in

1 1 1 1 0 0 0 0 0 0
0 0 0 1 1 1 1 0 0 0
1 1 1 0 0 1 1 0 0 0
0 0 0 1 0 1 0 1 1 1
19. Write the steps necessary to implement the bag of words
algorithm. Answer – The steps to implement bag of words
algorithm are as follows:
1. Text Normalisation: Collect data and pre-process it
2. Create Dictionary: Make a list of all the unique words occurring in the corpus.
3. Create document vectors: For each document in the corpus, find out how many times
the word from the unique list of words has occurred.
4. Create document vectors for all the documents.

20. Imagine developing a prediction model based on AI and deploying it to

monitor traffic congestion on the roadways. Now, the model’s goal is to foretell
whether or not there will be a traffic jam. We must now determine whether or
not the predictions this model generates are accurate in order to gauge its
efficacy. Prediction and Reality are the two conditions that we need to
consider.
Today, traffic jams are a regular occurrence in our life. Every time you get on the
road when you live in an urban location, you have to deal with traffic. Most
pupils choose to take buses to school. Due to these traffic bottlenecks, the bus
frequently runs late, making it impossible for the pupils to get to school on time.
Create a Confusion Matrix for the aforementioned scenario while taking into
account all potential outcomes.
Answer –
Case 1: Is there a traffic Jam?
Prediction: Yes Reality: Yes True Positive
Case 2: Is there a traffic Jam?
Prediction: No Reality: No True Negative
Case 3: Is there a traffic Jam?
Prediction: Yes Reality: No False Positive
Case 4: Is there a traffic Jam?
Prediction: No Reality: Yes False Negative

21. Make a 4W Project Canvas.

Risks will become more concentrated in a single network as more and more
innovative technologies are used. In such cases, cybersecurity becomes incredibly
complex and is no longer under the authority of firewalls. It won’t be able to
recognise odd behaviour patterns, including data migration.
Consider how AI systems can sift through voluminous data to find user
behaviour that is vulnerable. To explicitly define the scope, the method of data
collection, the model, and the evaluation criteria, use an AI project cycle.

NLP-Questions Class 10 Ai
No ratings yet
NLP-Questions Class 10 Ai
8 pages
NLP Q&A for Class X AI Course
No ratings yet
NLP Q&A for Class X AI Course
7 pages
517-C-30070-Assignment - Chapter NLP
No ratings yet
517-C-30070-Assignment - Chapter NLP
9 pages
Q ClassX AI Ch7
No ratings yet
Q ClassX AI Ch7
6 pages
NLP and Evaluation
No ratings yet
NLP and Evaluation
23 pages
Class 10 AI: NLP Question Bank
No ratings yet
Class 10 AI: NLP Question Bank
11 pages
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
No ratings yet
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
11 pages
NLP Applications and Techniques
No ratings yet
NLP Applications and Techniques
7 pages
NLP Revision Notes
No ratings yet
NLP Revision Notes
6 pages
AI Notes on Natural Language Processing
No ratings yet
AI Notes on Natural Language Processing
11 pages
100 NLP Questions
100% (6)
100 NLP Questions
23 pages
NLP Q&A1a Text Processing
No ratings yet
NLP Q&A1a Text Processing
16 pages
NLP and Evaluation - MCQ
No ratings yet
NLP and Evaluation - MCQ
10 pages
NLP Worksheet
No ratings yet
NLP Worksheet
3 pages
NLP - CH-6
No ratings yet
NLP - CH-6
4 pages
Top NLP Interview Questions & Answers
No ratings yet
Top NLP Interview Questions & Answers
24 pages
NLP Class X AI
No ratings yet
NLP Class X AI
36 pages
Question Bank For Seen Pre-Board - AI - Grade 10 - 2021-22
No ratings yet
Question Bank For Seen Pre-Board - AI - Grade 10 - 2021-22
7 pages
Q - ClassX - AI - NATURAL LANGUAGE PROCESSING
No ratings yet
Q - ClassX - AI - NATURAL LANGUAGE PROCESSING
10 pages
Natural Language Processing - Back Exercises
No ratings yet
Natural Language Processing - Back Exercises
15 pages
NLP Notes
No ratings yet
NLP Notes
3 pages
NLP Qa
No ratings yet
NLP Qa
10 pages
Ch-6 Natural Language Processing Q&A's
100% (1)
Ch-6 Natural Language Processing Q&A's
8 pages
NLP Questions
No ratings yet
NLP Questions
3 pages
Unit-I QB
No ratings yet
Unit-I QB
5 pages
Natural Language Processing Important Questions Answers
100% (1)
Natural Language Processing Important Questions Answers
31 pages
Assignment 04 NLP
No ratings yet
Assignment 04 NLP
6 pages
Data Science Interview Preparation Questions (#Day06)
No ratings yet
Data Science Interview Preparation Questions (#Day06)
10 pages
NLP
No ratings yet
NLP
14 pages
X - AI-NLP Worksheet
No ratings yet
X - AI-NLP Worksheet
2 pages
L-6 NLP
No ratings yet
L-6 NLP
11 pages
AI HW
No ratings yet
AI HW
4 pages
Natural Language Processing (NLP)
No ratings yet
Natural Language Processing (NLP)
5 pages
Question Bank Grade X Unit6 NLP 2024-25
No ratings yet
Question Bank Grade X Unit6 NLP 2024-25
4 pages
Natural Language Processing Revision Notes
No ratings yet
Natural Language Processing Revision Notes
4 pages
Board QP Solution and Notes
No ratings yet
Board QP Solution and Notes
36 pages
NLP Unsolved Ans
No ratings yet
NLP Unsolved Ans
3 pages
Intro to NLP and Chatbots
No ratings yet
Intro to NLP and Chatbots
3 pages
NLP Objectives MID-2
No ratings yet
NLP Objectives MID-2
4 pages
NLP Worksheet for Students
No ratings yet
NLP Worksheet for Students
10 pages
NLP ANONYMOUS QB Ans
No ratings yet
NLP ANONYMOUS QB Ans
21 pages
Grade X AI: NLP Home Test
No ratings yet
Grade X AI: NLP Home Test
2 pages
AI UNIT 6 and UNIT 7 Question and Answers
No ratings yet
AI UNIT 6 and UNIT 7 Question and Answers
10 pages
Keyword Techniques in Text Processing
No ratings yet
Keyword Techniques in Text Processing
28 pages
Understanding Subjective AI Concepts
No ratings yet
Understanding Subjective AI Concepts
43 pages
Banking Document Parsing Techniques
No ratings yet
Banking Document Parsing Techniques
13 pages
Natural Language Processing Notes Class 10
No ratings yet
Natural Language Processing Notes Class 10
10 pages
DLT Unit-5
No ratings yet
DLT Unit-5
48 pages
Important 2 Marks
No ratings yet
Important 2 Marks
11 pages
NLP Comprehensive Study Guide Pokhara University Fall 2025
No ratings yet
NLP Comprehensive Study Guide Pokhara University Fall 2025
50 pages
NLP Notes CL 10
No ratings yet
NLP Notes CL 10
13 pages
Question Bank On NLP, COA, ITB
No ratings yet
Question Bank On NLP, COA, ITB
154 pages
NLP Exam Questions 2023-24
No ratings yet
NLP Exam Questions 2023-24
5 pages
Quest NLP
No ratings yet
Quest NLP
13 pages
Exp 7
No ratings yet
Exp 7
9 pages
JACOB Data Science Chatbot A Comprehensive Guide
No ratings yet
JACOB Data Science Chatbot A Comprehensive Guide
10 pages
Mathematics: Pearson Edexcel Level 3 GCE
No ratings yet
Mathematics: Pearson Edexcel Level 3 GCE
48 pages
Instruction-Level Parallelism 2
No ratings yet
Instruction-Level Parallelism 2
77 pages
Brochure AVEVA InTouch2023 Overview 22-07
No ratings yet
Brochure AVEVA InTouch2023 Overview 22-07
3 pages
Hardware Manual
No ratings yet
Hardware Manual
52 pages
Development Planning & Project Analysis II
No ratings yet
Development Planning & Project Analysis II
198 pages
9.4 ConvPoolAsPrior
No ratings yet
9.4 ConvPoolAsPrior
13 pages
DLL Tle-Ict 9 q2 w1
No ratings yet
DLL Tle-Ict 9 q2 w1
10 pages
Imagepress c7010vp c6010vp c6010 SM
No ratings yet
Imagepress c7010vp c6010vp c6010 SM
2,186 pages
Ramsey Micro-Tech 9104 User Manual
No ratings yet
Ramsey Micro-Tech 9104 User Manual
102 pages
Sunny Sharma Resume
No ratings yet
Sunny Sharma Resume
2 pages
MCQs on Operating Systems Concepts
No ratings yet
MCQs on Operating Systems Concepts
4 pages
DL - 3850 - Parts Catalog
No ratings yet
DL - 3850 - Parts Catalog
21 pages
Operating Systems Study Guide
No ratings yet
Operating Systems Study Guide
3 pages
User Manual Tuya Smart Wi-Fi 1 Channel Switch Module With RF and Remote Control
No ratings yet
User Manual Tuya Smart Wi-Fi 1 Channel Switch Module With RF and Remote Control
3 pages
OOP-I - Practical - List - Even - 2022-23
No ratings yet
OOP-I - Practical - List - Even - 2022-23
3 pages
Tableau Interview Presentation
No ratings yet
Tableau Interview Presentation
41 pages
Salesforce Sales Cloud Interview Questions
No ratings yet
Salesforce Sales Cloud Interview Questions
7 pages
Exploring Data with SAS Procedures
No ratings yet
Exploring Data with SAS Procedures
2 pages
AN0739 M20 DV5700 TrimaxTX40
No ratings yet
AN0739 M20 DV5700 TrimaxTX40
31 pages
ITEC 3500 Final Exam Prep - Practice Test
No ratings yet
ITEC 3500 Final Exam Prep - Practice Test
5 pages
Spring Batch Processing
No ratings yet
Spring Batch Processing
16 pages
Sample of Project File
No ratings yet
Sample of Project File
75 pages
Continual Learning in Inertial Measurement Unit Based Hum 2025 Expert System
No ratings yet
Continual Learning in Inertial Measurement Unit Based Hum 2025 Expert System
18 pages
Student Exploration: Sight vs. Sound Reactions
No ratings yet
Student Exploration: Sight vs. Sound Reactions
10 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Intermediate English Exam Guide
No ratings yet
Intermediate English Exam Guide
6 pages
Colour Image Watermarking Based On Wavelet and QR Decomposition
No ratings yet
Colour Image Watermarking Based On Wavelet and QR Decomposition
4 pages
DWBI Unit-1
No ratings yet
DWBI Unit-1
19 pages
Future of Petaflops Computing
No ratings yet
Future of Petaflops Computing
3 pages

SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP

Uploaded by

SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP

Uploaded by

SKD Academy (CBSE)

3. What is the difference between stemming and lemmatization?

4. What is meant by a dictionary in NLP?

5. What is term frequency?

6. Which package is used for Natural Language Processing in Python

7. What is a document vector table?

8. What do you mean by corpus?

9. Differentiate between a script-bot and a smart-bot.

10. What is inverse document frequency?

12. What is TFIDF?

15. Explain the concept of Bag of Words.

16. Explain the relation between occurrence and value of a word.

17. What are the applications of TFIDF?

18. Create a document vector table for the given corpus:

We Are going to Mumbai is a famous place I am in

20. Imagine developing a prediction model based on AI and deploying it to

21. Make a 4W Project Canvas.

You might also like