Week3 Assignment
Week3 Assignment
Introduction to
Machine Learning
Assignment- Week 3
TYPE OF QUESTION: MCQ
Number of questions: 8 Total mark: 8 X 2 = 16
QUESTION 1:
Suppose, you have given the following data where x and y are the 2 input variables and Class is
the dependent variable.
X Y Class
-1 1 -
0 1 +
0 2 -
1 -1 -
1 0 +
1 2 +
2 2 -
2 3 +
Suppose, you want to predict the class of new data point x=1 and y=1 using euclidean distance in
7-NN. To which class the data point belongs to?
A. + Class
B. – Class
C. Can’t say
D. None of these
______________________________________________________________________________
QUESTION 2:
Imagine you are dealing with 15 class classification problem. What is the maximum number of
discriminant vectors that can be produced by LDA?
A. 20
B. 14
C. 21
D. 10
Correct Answer: B. 14
Detailed Solution : LDA produces at most c − 1 discriminant vectors, c = no of classes
______________________________________________________________________________
QUESTION 3:
‘People who bought this, also bought…’ recommendations seen on amazon is a result of which
algorithm?
Detailed Solution : Though both User based and Item based CF methods are used in
recommendation systems, Amazon specifically uses Item based filtering.
______________________________________________________________________________
NPTEL Online Certification Courses Indian
Institute of Technology Kharagpur
QUESTION 4:
A. Only 2
B. 1, 3 and 4
C. 1, 2 and 3
D. 2, 3 and 4
Correct Answer: D
Detailed Solution : PCA is an unsupervised learning algorithm, so 1 is wrong. Other options are
true about PCA.
______________________________________________________________________________
QUESTION 5:
Consider the figures below. Which figure shows the most probable PCA component directions for
the data points?
A. A
B. B
C. C
D. D
Correct Answer: A. A
Detailed Solution : PCA tries to choose the direction in such a way that maximizes the variance in the
data.
______________________________________________________________________________
QUESTION 6:
When there is noise in data, which of the following options would improve the performance of the KNN
algorithm?
Detailed Solution : Increasing the value of k reduces the effect of the noise and improves the
performance of the algorithm.
______________________________________________________________________________
QUESTION 7:
A. KNN algorithm does more computation on test time rather than train time.
B. KNN algorithm does lesser computation on test time rather than train time.
C. KNN algorithm does an equal amount of computation on test time and train time.
D. None of these.
Correct Answer: A. KNN algorithm does more computation on test time rather than train
time.
Detailed Solution : The training phase of the algorithm consists only of storing the feature
vectors and class labels of the training samples.
In the testing phase, a test point is classified by assigning the label which are most frequent
among the k training samples nearest to that query point – hence higher computation.
______________________________________________________________________________
QUESTION 8:
Find the value of the Pearson’s correlation coefficient of X and Y from the data in the following
table.
43 99
21 65
25 79
42 75
A. 0.47
B. 0.68
C. 1
D. 0.33
Correct Answer : B. 0.68
∑(𝑋𝑖−𝑋)((𝑌𝑖−𝑌)
Detailed Solution : Pearson Coefficient 𝑟 = 𝑖
2 2
∑(𝑋𝑖−𝑋) ∑(𝑌𝑖−𝑌)
𝑖 𝑖
______________________________________________________________________________
******END*****