[4th NLP'22] Final Exam
[4th NLP'22] Final Exam
Zaki Taha
Subject: CSC340 Natural Language Processing Dr. Sally Saad
Exam: (Final) 18/6/2022 Offering Dept.: Computer Science
Year: 4th year undergraduate (CS) Academic year: 2021-2022
Duration: 2 hours
Question 3
Answer the following Questions regarding the Search Engines. marks:15
Search Engines are one of the most common applications in NLP.
1) State briefly three challenges that may encounter the work of a search engine.
2) What are the three main characteristics that should be found in a search
engine?
3) State briefly how do the following searching techniques work?
a. Bi-Word Indexes.
b. Positional Indexes.
c. Ranked Retrieval Search.
Question 4
Implement the following NLP tasks on the given Case study.
marks: 15
ذهب محمد ونادر إىل المدرسة.1
. ثم قابال أصدقائهما ولعبوا فرحي ف المدرسة.2
Page 1 of 2
3) Tokenization and Lemmatization. (2 marks)
4) Compute the Tf-Idf for each document/word, showing the following tables: (9 marks)
a. Bag of Words and Term Frequency Tables.
b. Inverse Document Frequency Table.
c. TF-IDF Table.
Given the following POS Tags to be used
Abbreviation Meaning
S sentence; subject
PP prepositional phrase
IN Preposition or subordinating
conjunction
VP Verbal phrase
NP noun phrase
DTNN Determiner
JJ Adjective
CC Coordinating conjunction
Page 2 of 2