Total No. of Questions : 4] SEAT No.
8
23
PA-10288 [Total No. of Pages : 1
ic-
[6009]-322
tat
4s
T.E. (Computer Engineering) (Insem.)
9:1
DATA SCIENCE AND BIG DATA ANALYTICS
02 91
2:0
(2019 Pattern) (Semester - II) (310251)
0
31
Time : 1 Hour] 3/0 13 [Max. Marks : 30
0
Instructions to the candidates:
4/2
.23 GP
1) Answer questions Q.1 or Q.2, Q.3 or Q.4.
2) Neat diagrams must be drawn wherever necessary.
E
80
8
3) Figures to the right side indicate full marks.
C
23
4) Assume suitable data if necessary.
ic-
16
5) Use of Scientific calculator is allowed.
tat
8.2
4s
.24
Q1) a) What are dimensionality reduction and its benefits? [4]
9:1
91
49
b) What is data wrangling? Why do you need it? [5]
2:0
30
c) What is regression? Explain different types of regression with example.
31
[6]
01
02
OR
4/2
GP
Q2) a) Differentiate between Data Science, Machine Learning and AI. [4]
3/0
CE
b) What does feature engineering typically includes? [5]
80
8
23
.23
c) What is Data Discretization, explain Forms of data discretization. [6]
ic-
16
tat
8.2
4s
Q3) a) Write a short note on contingency table, explain with example. [4]
.24
9:1
b) With an example explain Baye's theorem. Also explain its key terms.
91
49
2:0
[5]
30
31
c) Is there a correlation between the variables in the following data set? [6]
01
02
Hours 9 15 25 14 10 18 19 16 20 18
4/2
Marks 39 56 93 61 50 75 42 70 66 32
GP
3/0
OR
CE
80
Q4) a) What is population & how is it differ from a sample? [4]
.23
b) With an example, explain one-tailed & two-tailed t-tests. [5]
16
c) Describe the Chi-Square Test of Independence. [6]
8.2
.24
49