Eti 3111
Eti 3111
1. Lemmatization
2. Levenshtein
3. Stemming
4. Soundex
A) 1 and 2
B) 2 and 4
C) 1 and 3
D) 1, 2 and 3
E) 2, 3 and 4
F) 1, 2, 3 and 4
2) N-grams are defined as the combination of N keywords together. How many bi-
grams can be generated from given sentence:
A) 7
B) 8
C) 9
D) 10
E) 11
3) How many trigrams phrases can be generated from the following sentence,
after performing following text cleaning steps:
• Stopword Removal
• Replacing punctuations by a single space
A) 3
B) 4
C) 5
D) 6
E) 7
4) Which of the following regular expression can be used to identify date(s)
present in the text object:
A) \d{4}-\d{2}-\d{2}
B) (19|20)\d{2}-(0[1-9]|1[0-2])-[0-2][1-9] C) (19|20)\d{2}-(0[1-9]|1[0-2])-([0-2][1-9]|3[0-1])
D) None of the above
5) Which of the following models can perform tweet classification with regards to
context mentioned above?
A) Naive Bayes
B) SVM
C) None of the above
6) You have created a document term matrix of the data, treating every tweet as
one document. Which of the following is correct, in regards to document term
matrix?
1. Removal of stopwords from the data will affect the dimensionality of data
2. Normalization of words in the data will reduce the dimensionality of data
3. Converting all the words in lowercase will not affect the dimensionality of the data
A) Only 1
B) Only 2
C) Only 3
D) 1 and 2
E) 2 and 3
F) 1, 2 and 3
8) What percentage of the total statements are correct with regards to Topic
Modeling?
A) 0
B) 25
C) 50
D) 75
E) 100
9) In Latent Dirichlet Allocation model for text classification purposes, what does
alpha and beta hyper parameter represent-?
A) Alpha: number of topics within documents, beta: number of terms within topics
False
B) Alpha: density of terms generated within topics, beta: density of topics
generated within terms False
C) Alpha: number of topics within documents, beta: number of terms within topics
False
D) Alpha: density of topics generated within documents, beta: density of
terms generated within topics True
10) Solve the equation according to the sentence “I am planning to visit New Delhi
to attend Analytics Vidhya Delhi Hackathon”.
A) 5, 5, 2
B) 5, 5, 0
C) 7, 5, 1
D) 7, 4, 2
E) 6, 4, 3
5. 12) Which of the following documents contains the same number of terms
and the number of terms in the one of the document is not equal to least
number of terms in any document in the entire corpus.
A) d1 and d4
B) d6 and d7
C) d2 and d4
D) d5 and d6
6. 14) What is the term frequency of a term which is used a maximum number
of times in that document?
A) t6 – 2/5
B) t3 – 3/6
C) t4 – 2/6
D) t1 – 2/6
7. 15) Which of the following technique is not a part of flexible text matching?
A) Soundex
B) Metaphone
C) Edit Distance
D) Keyword Hashing
A) TRUE
B) FALSE
9. 17) Which of the following statement is(are) true for Word2Vec model?
1. Text cleaning
2. Text annotation
3. Gradient descent
4. Model tuning
5. Text to predictors
A) 12345
B) 13425
C) 12534
D) 13452
21) Which of the following models can be used for the purpose of document
similarity?
A) Training a word 2 vector model on the corpus that learns context present in the
document
B) Training a bag of words model that learns occurrence of words in the document
C) Creating a document-term matrix and using cosine similarity for each document
D) All of the above
A) 1
B) 12
C) 123
D) 1234
E) 12345
F) 123456
23) While creating a machine learning model on text data, you created a document
term matrix of the input data of 100K documents. Which of the following remedies
can be used to reduce the dimensions of data –
A) only 1
B) 2, 3
C) 1, 3
D) 1, 2, 3
A) 1
B) 2
C) 1, 2
D) 1, 2, 3
25) While working with text data obtained from news sentences, which are
structured in nature, which of the grammar-based text parsing techniques can be
used for noun phrase detection, verb phrase detection, subject detection and
object detection.
26) Social Media platforms are the most intuitive form of text data. You are given a
corpus of complete social media data of tweets. How can you create a model that
suggests the hashtags?
27) While working with context extraction from a text data, you encountered two
different sentences: The tank is full of soldiers. The tank is full of nitrogen. Which
of the following measures can be used to remove the problem of word sense
disambiguation in the sentences?
29) Retrieval based models and Generative models are the two popular techniques
used for building chatbots. Which of the following is an example of retrieval model
and generative model respectively.
30) What is the major difference between CRF (Conditional Random Field) and
HMM (Hidden Markov Model)?
.2 Modern NLP algorithms are based on machine learning, especially statistical machine
learning.
a) True
b) False
Ans:- a) True
Ans:- a) True
Ans - a) True
Ans- a) True
6Given a sound clip of a person or people speaking, determine the textual representation of the
speech.
a) Text-to-speech
b) Speech-to-text
c) All of the mentioned
d) None of the mentioned
Ans:- b) Speech-to-text
• A. Voluminous
• B. Difficult to characterize
• C. Variability
2. Morphological Segmentation
• B. Separate words into individual morphemes and identify the class of the
morphemes
• D. None
• D. All of these
• A. Facts
• B. Symbols
• C. Both A and B
• D. None
• A. Representation Adequacy
• B. Inferential Adequacy
• C. Inferential Efficiency
• D. All of these
6. Natural Language Processing (NLP) is field of
• A. Computer Science
• B. Artificial Intelligence
• C. Linguistics
• A. Lexical ambiguity
• C. Both A and B
• D. None
• A. Computer Science
• B. Cognitive Science
• C. Engineering
• A. Computer Science
• B. Biology
• C. Psychology
• C. Algorithm specialists
View Answer
Artificial intelligence is Making a Machine intelligent
• Education
• Defense
• Energy
• Justice
View Answer
DARPA, the agency that has funded a great deal of American Artificial Intelligence
research, is part of the Department of Defense.
• John McCarthy
• Fisher Ada
• Allen Newell
• Alan Turning
View Answer
the “father” of artificial intelligence is Fisher Ada .
• IntelliCorpn
• Teknowledge
• Texas Instruments
• Tech knowledge
View Answer
KEE is a product of IntelliCorpn .
• Analogical reasoning
• Bitonic reasoning
• Non-monotonic reasoning
• Monotonic reasoning
View Answer
Default reasoning is another type of Non-monotonic reasoning.
7. Weak AI is
• a set of computer programs that produce output that would be considered to reflect
intelligence if it were generated by humans.
• the study of mental faculties through the use of mental models implemented on a computer.
• the embodiment of human intellectual capabilities within a computer.
• All of the above
View Answer
Weak AI is the study of mental faculties through the use of mental models implemented
on a computer.
• mobile
• open loop
• intelligent
• non-servo
View Answer
If a robot can alter its own trajectory in response to external conditions, it is considered to
be intelligent .
1) What is the field of Natural Language Processing (NLP)?
a) Computer Science
b) Artificial Intelligence
c) Linguistics
d) All of the mentioned
Answer: d
2) NLP is concerned with the interactions between computers and human
(natural) languages.
a) True
b) False
Answer: a
3) What is the main challenge/s of NLP?
a) Handling Ambiguity of Sentences
b) Handling Tokenization
c) Handling POS-Tagging
d) All of the mentioned
Answer: a
4) NLP stands for Natural Language Processing.
a) True
b) False
Answer: true
5) Choose form the following areas where NLP can be useful.
a) Automatic Text Summarization
b) Automatic Question-Answering Systems
c) Information Retrieval
d) All of the above
Answer: d
6) Natural language processing is divided into the two sub fields of:
a) Symbolic and numeric
b) Time and motion
c) Algorithmic and heuristic
d) Understanding and generation
Answer: d
7) A natural language generation program must decide:
a) what to say
b) when to say something
c) why it is being used
d) both a and b
Answer: d
8) People overcome natural language problems by:
a) grouping attributes into frames
b) understanding ideas in context
c) identifying with familiar situations
d) both (b) and (c)
Answer: d
MCQS on Natural Language Processing
1. What is the field of Natural Language Processing (NLP)?
a) Computer Science
b) Artificial Intelligence
c) Linguistics
d) All of the mentioned
Answer: d
2. NLP is concerned with the interactions between computers and human (natural)
languages.
a) True
b) False
Answer: a
Answer: d
7. What is Coreference Resolution?
a) Anaphora Resolution
b) Given a sentence or larger chunk of text, determine which words (“mentions”) refer to the
same objects (“entities”)
c) All of the mentioned
d) None of the mentioned
Answer: b
9. The more general task of coreference resolution also includes identifying so-called
“bridging relationships” involving referring expressions.
a) True
b) False
Answer: a
11. Given a stream of text, Named Entity Recognition determines which pronoun maps
to which noun.
a) False
b) True
Answer: a
12. Natural Language generation is the main task of Natural language processing.
a) True
b) False
Answer: a
15. Parsing determines Parse Trees (Grammatical Analysis) for a given sentence.
a) True
b) False
Answer: a
16. IR (information Retrieval) and IE (Information Extraction) are the two same thing.
a) True
b) False
Answer: b
17. Many words have more than one meaning; we have to select the meaning which
makes the most sense in context. This can be resolved by ____________
a) Fuzzy Logic
b) Word Sense Disambiguation
c) Shallow Semantic Analysis
d) All of the mentioned
Answer: b
18. Given a sound clip of a person or people speaking, determine the textual
representation of the speech.
a) Text-to-speech
b) Speech-to-text
c) All of the mentioned
d) none of the mentioned
Answer: b
21. Collaborative Filtering and Content Based Models are the two popular
recommendation engines, what role does NLP play in building such algorithms.
Answer: d
22. With respect to this context-free dependency graphs, how many sub-trees exists in
the sentence?
a) 3
b) 4
c) 5
d) 6
Answer: d
MCQ on Natural Language Processing
1. What is the field of Natural Language Processing (NLP)?
a) Computer Science
b) Artificial Intelligence
c) Linguistics
d) All of the mentioned
Answer: d
2. NLP is concerned with the interactions between computers and human (natural) languages.
a) True
b) False
Answer: a
4. Modern NLP algorithms are based on machine learning, especially statistical machine
learning.
a) True
b) False
Answer: a.
9. DEC advertises that it helped to create “the world’s first expert system routinely used in an
industrial environment,” called XCON or __________
a) PDP-11
b) Rl
c) VAX
d) MAGNOM
Answer: b
10. Prior to the invention of time-sharing, the prevalent method of computer access was
____________
a) batch processing
b) telecommunication
c) remote access
d) all of the mentioned
Answer: a
11. Seymour Papert of the MIT AI lab created a programming environment for children
called ___________
a) BASIC
b) LOGO
c) MYCIN
d) FORTRAN
Answer: b
12. Which of the following is a project of the Strategic Computing Program?
a) Defense Advanced Research Projects Agency
b) National Science Foundation
c) Jet Propulsion Laboratory
d) All of the mentioned
Answer: a
13. The original LISP machines produced by both LMI and Symbolics were based on
research performed at?
a) CMU
b) MIT
c) Stanford University
d) RAMD
Answer: b
16. In LISP, the function returns t if is a CONS cell and nil otherwise ________
a) (cons )
b) (consp )
c) (eq )
d) (cous =)
Answer:b
17. In a rule-based system, what is the form of procedural domain knowledge?
a) production rules
b) rule interpreters
c) meta-rules
d) control rules
Answer: a
18. If a robot can alter its own trajectory in response to external conditions, it is considered to
be ____________
a) intelligent
b) mobile
c) open loop
d) non-servo
Answer: a
20. One of the leading American robotics centres is the Robotics Institute located at?
a) CMU
b) MIT
c) RAND
d) SRI
Answer: a
1. What is the field of Natural Language Processing (NLP)?
a) Computer Science
b) Artificial Intelligence
c) Linguistics
d) All of the mentioned
2. NLP is concerned with the interactions between computers and human (natural) languages.
a) True
b) False
3. What is the main challenge/s of NLP?
a) Handling Ambiguity of Sentences
b) Handling Tokenization
c) Handling POS-Tagging
d) All of the mentioned
4. Modern NLP algorithms are based on machine learning, especially statistical machine
learning.
a) True
b) False
5. Which of the following includes major tasks of NLP?
a) Automatic Summarization
b) Discourse Analysis
c) Machine Translation
d) All of the mentioned
6. What is Machine Translation?
a) Converts one human language to another
b) Converts human language to machine language
c) Converts any human language to English
d) Converts Machine language to human language
7. What is Morphological Segmentation?
a) Does Discourse Analysis
b) Separate words into individual morphemes and identify the class of the morphemes
c) Is an extension of propositional logic
d) None of the mentioned
8. Natural Language generation is the main task of Natural language processing.
a) True
b) False
9. OCR (Optical Character Recognition) uses NLP.
a) True
b) False
10. Parts-of-Speech tagging determines ___________
a) part-of-speech for each word dynamically as per meaning of the sentence
b) part-of-speech for each word dynamically as per sentence structure
c) all part-of-speech for a specific word given as input
d) all of the mentioned
11. Parsing determines Parse Trees (Grammatical Analysis) for a given sentence.
a) True
b) False
MCQs on Natural Language Processing
1. What is the field of Natural Language Processing (NLP)?
i. Computer Science
ii. Artificial Intelligence
iii. Linguistics
iv. All of the mentioned
2. NLP is concerned with the interactions between computers and human (natural)
languages.
i. True
ii. False
9. The more general task of coreference resolution also includes identifying so-called
“bridging relationships” involving referring expressions.
i. True
ii. False
i. MQ Telemetry Things
ii. MQ Transport Telemetry
iii. MQ Transport Things
iv. MQ Telemetry Transport
i. True
ii. False
i. MQTT
ii. HTTP
iii. CoAP
iv. SPI
4. PubNub publishes and subscribes _________ in order to send and receive messages.
i. Network
ii. Account
iii. Portal
iv. Keys
i. Bluetooth
ii. UART
iii. WiFi
iv. HTTP