SAIDS Mu Ques Paper Merged
SAIDS Mu Ques Paper Merged
3A
2
32
8A
68
D4
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
3B
C4
63
AB
F9
45
42
32
A
68
43
2E
9D
B8
1T01875 - T.E. Computer Science & Enginering (Artificial Intelligence & Machine Learning) (Choice Based)
3B
63
2C
96
3A
45
32
(R-2019 'C' Scheme) SEMESTER - V / 48895 - Department Optional Course - 1: Statistics for Artificial
68
4
AE
2E
D
B
4
63
2C
F9
Intelligence & Data Science QP CODE: 10039067 DATE: 04/12/2023
3
6
C2
45
32
9
4
Duration: 3hrs [Max Marks:80]
36
15
D
B
A
62
C
F9
3
26
8A
42
9
C
68
E4
B3
AE
AB
15
D
*KI3狤狥
63
2
F9
53
8A
96
(1) Question No 1 is Compulsory.
2
43
32
5C
8
E4
E
B
2C
36
(2) Attempt any three questions out of the remaining five.
3B
2A
1
3A
62
26
A
D4
45
(3) All questions carry equal marks.
9
C
8
C4
B3
AE
B
36
5
2E
F9
A1
(4) Assume suitable data, if required and state it clearly.
A
2
53
26
68
96
D4
C2
43
B8
E4
3
63
AE
C
15
9
3B
F
3A
62
42
32
8A
68
C2
45
1 Attempt any four [20]
E9
D
3B
C4
3
2E
F9
6
2A
Write a short note on hypothesis testing.
A1
a)
A
45
42
32
96
43
36
2E
C
8
b) What is Fisher's exact test?
3B
4
AE
AB
2C
15
2E
9
26
96
8F
5
8A
4
96
D4
C2
3
3
AE
6
2E
C4
d) Write a short note on Random sampling
63
AE
AB
5
F9
3
6
C2
A1
45
2
32
e) What is the empirical CDF function?
9
C2
43
E
6
15
B8
B
2A
3
62
2C
15
9
3
6
8A
8F
3A
45
2
E9
5C
8A
2 a) Construct a frequency distribution table for the following weights (in gm) of 30 [10]
4
3
AB
36
E
9D
3B
C4
A
A1
AB
62
oranges using the equal class intervals, one of them is 40-45 (45 not included).
26
2
F
43
42
9
5C
B8
68
E4
The weights are: 31, 41, 46, 33, 44, 51, 56, 63, 71, 71, 62, 63, 54, 53, 51, 43,
43
E
2C
9D
3B
2A
63
1
3A
62
2C
36, 38, 54, 56, 66, 71, 74, 75, 46, 47, 59, 60, 61, 63.
A
D4
8F
45
32
9
C
8
C4
D4
E
B
36
5
E
F9
3B
2A
A1
3A
2
42
F9
6
(a) What is the class mark of the class intervals 50-55?
68
45
32
E9
9D
5C
B8
C4
68
63
B
A
63
A1
8F
3A
42
32
53
6
C2
32
36
9
9D
B8
3B
C4
E4
AE
5
3B
26
A1
8F
A
45
62
42
2
B3
43
45
b) What is the primary purpose of conducting a one-way ANOVA. Explain the [10]
36
2E
E9
9D
C
8
AB
2C
2E
53
26
2A
1
8F
8A
E4
96
4
B3
43
AE
5C
AE
AB
62
C
F9
53
26
C2
A1
2
E9
8
E4
D4
C2
B3
43
6
B8
2A
3
62
2C
15
9
53
8F
3A
3 a) Find the standard error of the estimate for the average number of children in a [10]
2
E9
5C
8A
E4
4
B3
36
9D
C4
household in your city by using the data collected from a sample of households
2A
A1
AB
62
53
8F
42
2
in your city. Then find a 95% confidence interval for the data.
E9
5C
B8
E4
43
36
9D
3B
2A
A1
3A
62
2C
26
8F
5
E9
5C
B8
D4
3
36
E
3B
2A
A1
3A
62
1 2
F9
26
45
E9
5C
B8
C4
68
B3
2 3
E
2A
63
A1
3A
2
42
53
3 1
6
32
9
D
5C
B8
C4
E4
AE
4 0
F9
3B
A1
3A
62
42
5 5
45
E9
9D
5C
B8
C4
2E
6 2
2A
A1
8F
3A
42
96
7 1
36
C
B8
C4
AE
5
F9
26
A1
8 4
3A
42
68
C2
B3
9D
B8
C4
15
8F
3A
42
regression?
32
8A
36
D
3B
C4
AB
F9
26
39067 Page 1 of 2
45
42
68
B3
43
2E
D
63
2C
F9
53
96
32
68
E4
D4
3B
63
62
F9
45
2AE962E453B326368F9D42C43AB8A15C
32
E9
68
1
F
3A
2
32
8A
68
D4
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
3B
C4
63
AB
F9
45
42
32
A
68
43
2E
9D
B8
3B
63
2C
96
3A
45
32
68
4
AE
2E
D
B
4
63
2C
F9
3
6
C2
45
32
9
4
E
4 a) A radar unit is used to measure speeds of cars on a motorway. The speeds are [10]
36
15
D
B
A
62
C
F9
3
26
8A
normally distributed with a mean of 90 km/hr and a standard deviation of 10
42
9
C
68
E4
B3
AE
AB
15
km/hr. What is the probability that a car picked at random is travelling at more
D
63
2
F9
53
8A
96
2
than 100 km/hr?
43
32
5C
8
E4
E
B
2C
36
3B
b) Explain Numerical and Categorical data types with appropriate examples [10]
2A
1
3A
62
26
A
D4
45
9
C
8
C4
B3
AE
B
36
5
2E
F9
A1
A
2
53
26
68
96
D4
C2
43
5 a) Duracell manufactures batteries that the CEO claims will last an average of 300 [10]
B8
E4
3
63
AE
C
15
9
3B
F
hours under normal use. A researcher randomly selected 20 batteries from the
3A
62
42
32
8A
68
C2
45
E9
D
3B
C4
production line and tested these batteries. The tested batteries had a mean life
2E
F9
6
2A
A1
A
45
42
span of 270 hours with a standard deviation of 50 hours. Do we have enough
32
96
43
36
2E
C
8
3B
4
evidence to suggest that the claim of an average lifetime of 300 hours is false?
AE
AB
2C
15
2E
9
26
96
8F
5
b) Explain linear least square regression (LLSR) along with it’s advantages and [10]
8A
4
96
D4
C2
3
3
AE
6
2E
C4
63
AE
disadvantages.
AB
5
F9
3
6
C2
A1
45
2
32
9
C2
43
E
6
15
B8
B
2A
3
62
2C
15
9
3
6
8A
8F
3A
45
2
E9
5C
8A
4
6 a) A farmer is trying out a planting technique that he hopes will increase the yield [10]
3
AB
36
E
9D
3B
C4
A
A1
AB
62
on his pea plants. The average number of pods on one of his pea plantsis 145
26
2
F
43
42
9
5C
B8
68
E4
43
E
pods with a standard deviation of 100 pods. This year, after trying his new
2C
9D
3B
2A
63
1
3A
62
2C
A
planting technique, he takes a random sample of his plants and finds theaverage
D4
8F
45
32
9
C
8
C4
D4
E
B
36
5
E
F9
3B
2A
A1
3A
2
42
F9
6
68
significant increase. What are his hypotheses and the test statistic? 45
32
E9
9D
5C
B8
C4
68
63
2E
What is the Chi-Square Test in statistics, and in what kind of situations or
B
b) [10]
A
63
A1
8F
3A
42
32
53
6
C2
32
36
9
9D
B8
3B
C4
E4
AE
5
3B
26
A1
8F
A
45
62
42
2
B3
43
45
36
2E
E9
9D
C
8
AB
2C
2E
53
26
96
2A
1
8F
8A
E4
96
4
B3
43
AE
36
5C
*********************
AE
AB
62
C
F9
53
26
C2
A1
2
E9
8
E4
D4
C2
B3
43
6
B8
2A
3
62
2C
15
9
53
8F
3A
2
E9
5C
8A
E4
4
B3
36
9D
C4
2A
A1
AB
62
53
8F
42
2
E9
5C
B8
E4
43
36
9D
3B
2A
A1
3A
62
2C
26
8F
5
E9
5C
B8
D4
3
36
E
3B
2A
A1
3A
62
F9
26
45
E9
5C
B8
C4
68
B3
E
2A
63
A1
3A
2
42
53
6
32
9
D
5C
B8
C4
E4
AE
F9
3B
A1
3A
62
42
45
E9
9D
5C
B8
C4
2E
2A
A1
8F
3A
42
96
36
C
B8
C4
AE
5
F9
26
A1
3A
42
68
C2
B3
9D
B8
C4
63
15
8F
3A
42
32
8A
36
D
3B
C4
AB
F9
26
39067 Page 2 of 2
45
42
68
B3
43
2E
D
63
2C
F9
53
96
32
68
E4
D4
3B
63
62
F9
45
2AE962E453B326368F9D42C43AB8A15C
32
E9
68
Paper / Subject Code: 48895 / Department Optional Course - I: Statistics for
Artificial Intelligence & Data Science
T10I885 - T.E. Computer Science & Engineering (Data Science) (Choice Based)
(R-2019-20’C Scheme) SEMESTER - V / 48895 - Department Optional Course -
I: Statistics for Artificial Intelligence & Data Science
QP CODE: 10014523
Date: 02/12/2022
[Marks: 80]
[Time: 3 Hours]
a) Find the standard deviation of the average temperatures recorded over a five-
day period last winter: 19, 21, 18, 24, 12?
d) The school principal wants to test if it is true what teachers say - that high
school juniors use the computer an average 3.2 hours a day. What are null and
alternative hypotheses?
a) Find the value of the correlation coefficient from the data given in the
following table:
1 43 99
2 21 65
3 25 79
4 42 75
5 57 87
6 59 81
1 25 20 18
2 30 28 36
3 28 30 34
4 38 35 22
5 31 35 28
88 77 71 69
82 76 56 65
86 84 64 68
87 59 51 81
a) If the sample mean and expected mean value of the marks obtained by 15
students in a class test is 290 and 300 respectively. What is the t-score if the
standard deviation of the marks is 50?
b) Find out what is the relation between the GPA of a class of students and the
number of hours of study and the height of the student:
2.9 66 7
3.0 65 8
GPA Height Study Hours
3.62 67 7
3.2 62.5 7
3.1 64 8
2.8 63 6
3.63 68 9
3.84 65 6
3.93 69 10
3.76 64 7
2.75 59 4
a) A farmer is trying out a planting technique that he hopes will increase the
yield on his pea plants. The average number of pods on one of his pea plants is
145 pods with a standard deviation of 100 pods. This year, after trying his new
planting technique, he takes a random sample of his plants and finds the
average number of pods to be 147. He wonders whether this is a statistically
significant increase. What are his hypotheses and the test statistic? Use a 0.05
significance level.
b) Find the simple linear regression equation that fits the given data and
coefficient of determination:
Hour Temp
1 21
2 27
4 25
8 86
10 92
12 96
Paper 1
a) Construct a frequency distribution table for the following weights (in gm) of
30 oranges using equal class intervals, one of them is 40-45.
Weights: 31, 41, 46, 33, 44, 51, 56, 61, 71, 63, 52, 64, 53, 51, 43, 36, 38, 54,
56, 66, 71, 74, 45, 47, 60, 59, 63, 61, 63, 60
1 2
2 1
3 2
4 2
5 1
6 2
7 4
8 2
9 1
10 2
a) Duracell manufactures batteries that the CEO claims will last an average of
300 hours under normal use. A researcher randomly selected 20 batteries from
the production line and tested these batteries. The tested batteries had a mean
life span of 270 hours with a standard deviation of 50 hours. Do we have enough
evidence to suggest that the claim of an average lifetime of 300 hours is false?
b) Explain Linear Least Square Regression (LLSR) along with its advantages and
disadvantages.
Q.6 (10 Marks)
a) A farmer is trying out a planting technique that he hopes will increase the
yield on his pea plants. The average number of pods on one of his pea plants is
145 pods with a standard deviation of 100 pods. This year, after trying his new
planting technique, he takes a random sample of his plants and finds the
average number of pods to be 147. He wonders whether this is a statistically
significant increase. What are his hypotheses and the test statistic?
Paper 2
a) For a certain type of computers, the length of time between charges of the
battery is normally distributed with a mean of 50 hours and a standard deviation
of 15 hours. John owns one of these computers and he wants to know the
probability that the length of time will be between 50 and 70 hours.
b) The average score on a test is 80 with a standard deviation of 10. With a new
teaching curriculum introduced it is believed that this score will change. On
testing random scores of 38 students, the mean score is 77. At 0.05 significance
level, is there any evidence to support this claim?
Observation A B C D
1 12 12 18 13
2 10 11 12 9
3 12 10 16 12
4 8 14 6 16
5 7 9 12 15
a) What is F-test? If the F statistic as 2.38 and the degrees of freedom obtained
by him were 8 and 3. Find out the F value from the F table and determine
whether we can reject the null hypothesis at 5% level of significance (one-tailed
test).
b) Find the simple linear regression equation that fits the given data and
coefficient of determination:
XY
2 69
3 68
XY
5 82
5 77
6 71
7 84
1. Chi-square distribution
2. Weibull distribution
4. Box Plot
Y
X
0X
10
10
25
F0
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
AF
YF
YF
X5
0A
10
25
25
June 13, 2024 02:30 pm - 05:30 pm 1T01875 - T.E. Computer Science & Enginering
F0
F0
F1
YF
X5
X5
0A
(Artificial Intelligence & Machine Learning) (Choice Based) (R-2019 'C' Scheme) SEMESTER - V /
0A
5Y
25
F0
F0
48895 - Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
1
2
YF
YF
X5
X5
0A
0A
QP CODE: 10056538
25
0
F0
F1
1
2
F
YF
5
X5
0A
Duration: 3hrs [Max Marks:80]
0A
5Y
0X
5
0
F1
1
52
52
AF
AF
YF
X5
5Y
0X
0X
N.B. 1. Question No. 1 is compulsory.
10
10
5
F0
52
2
AF
AF
F
2. Attempt any three questions out of remaining five.
YF
X5
0A
5Y
X
10
0
5
0
3. All questions carry equal marks
F0
F1
1
2
2
F
F
YF
X5
X5
A
0A
5Y
Y
4. Assume Suitable data, if required and state it clearly.
10
25
5
0
F0
1
52
1
2
F
F
YF
F
5
X5
0A
0A
5Y
0X
5Y
0X
1 Attempt any four: 20
F0
1
1
52
2
AF
2
F
F
YF
X5
(a) Define Confidence Interval?
X5
0A
0A
5Y
0X
10
5
0
F0
(b) In a certain property investment company with an international presence,
F1
52
2
AF
F
F
X5
A
5Y
0A
5Y
Y
0X
workers have a mean hourly wage of $12 with a population standard deviation
10
10
25
0
0
52
1
52
AF
F
F
AF
of $3. Given a sample size of 30, estimate and interpret the SE of the sample
YF
YF
X5
0A
5Y
0X
0X
0
10
mean.
25
25
F0
1
F1
52
AF
F
F
YF
5
X5
A
0A
(c) What is hypothesis testing? Explain type I and type II errors?
5Y
5Y
0X
0X
10
25
0
1
F1
52
(d) What do you mean by correlation and regression? Explain with example.
52
AF
F
YF
F
F
X5
0A
A
Y
5Y
0X
0X
(e) What is analysis of variance? Explain its usage.
10
25
10
25
F0
1
2
F
AF
F
F
X5
YF
X5
X5
(a) X is a normally distributed variable with mean μ = 30 and standard deviation 10
0A
5Y
0A
2
5Y
10
F0
25
0
0
F1
σ = 4. Find
52
F1
52
AF
F
F
0A
X5
0A
5Y
5Y
0X
5Y
0X
a) P(x < 40)
10
F1
F0
1
52
52
AF
52
F
YF
F
b) P(x > 21)
A
5Y
0A
Y
0X
0X
0X
10
10
25
25
c) P(30 < x < 35)
52
F1
AF
F
YF
AF
YF
X5
X5
0A
0X
5Y
(b)
10. Some vehicles pass through a junction on a busy road at an average rate of 300 10
10
25
10
5
F0
F0
1
52
AF
52
per hour.
YF
F
X5
YF
0A
0A
Y
0X
0X
10
25
25
F1
F1
2
AF
YF
F
X5
5
0A
X5
b. What is the expected number of passing in two minutes?
A
5Y
5Y
X
10
25
10
F0
F0
F1
c. Find the probability that this expected number found above actually F0
52
52
YF
X5
YF
0A
0A
5Y
0A
0X
0X
25
F1
1
52
F1
AF
AF
YF
X5
0A
X5
5Y
0X
5Y
10
10
25
F0
3 (a) For a certain type of computers, the length of time between charges of the 10
F1
F0
52
AF
52
YF
F
X5
0A
5Y
0A
Y
0X
25
25
0
F1
52
F1
AF
AF
deviation of 15 hours. John owns one of these computers and wants to know
YF
AF
X5
X5
5Y
0X
5Y
10
10
the probability that the length of time will be between 50 and 70 hours.
25
10
F0
0
52
AF
52
AF
YF
YF
X5
YF
0A
0X
0X
10
25
25
F0
25
F1
(b) The average score on a test is 80 with a standard deviation of 10. With a new 10
AF
AF
YF
X5
X5
0A
X5
5Y
10
25
F0
F0
F1
F0
2
YF
YF
random testing, the score of 38 students, the mean was found to be 88. With a
X5
X5
0A
0A
5Y
0A
25
25
F0
F1
2
F1
X5
X5
X5
0A
0A
5Y
5Y
F0
F1
F1
52
52
AF
between variables.
X5
0A
5Y
5Y
0X
0X
10
F0
F1
52
52
AF
AF
YF
0A
5Y
0X
0X
10
10
25
2
AF
AF
YF
YF
YF
X5
X5
10
10
25
25
F0
F0
YF
YF
X5
X5
0A
0A
25
25
F0
0
F1
F1
AF
X5
X5
0A
5Y
5Y
10
F0
F0
F1
52
YF
X5
0A
0A
5Y
0X
25
0
F1
2
AF
AF
X5
X5
5Y
10
10
F0
F0
52
YF
YF
0A
0A
56538 Page 1 of 2
0X
25
25
F1
F1
AF
X5
X5
5Y
5Y
10
0
52
52
AF
YF
0X
X525YF10AF0X525YF10AF0X525YF10AF0X525YF10AF0
0X
10
25
Y
X
0X
10
10
25
F0
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
AF
YF
YF
X5
0A
10
25
25
F0
F0
F1
YF
X5
X5
0A
0A
5Y
25
F0
F0
1
1
2
YF
YF
X5
X5
0A
0A
5
b) Given four samples A, B, C, D. Solve using one-way ANOVA to identify any 10
25
0
F0
F1
1
2
F
YF
5
X5
0A
0A
difference between samples.
5Y
0X
5
0
F1
1
52
52
AF
AF
YF
Observation A B C D
X5
5Y
0X
0X
10
10
5
0
52
2
AF
AF
F
YF
1 8 12 18 13
X5
0A
5Y
X
10
10
5
0
F0
F1
2
2
F
YF
2 10 11 12 9
YF
X5
X5
A
0A
Y
10
25
25
25
0
F0
F1
F1
F
YF
3 12 9 16 12
X5
X5
A
0A
5Y
5Y
0X
0
25
F0
F0
1
1
2
52
F
F
4 8 14 6 16
YF
X5
X5
0A
0A
0A
5Y
0X
5
F0
0
F1
1
2
2
F
AF
F
5 7 4 8 15
YF
5
X5
A
A
5Y
5Y
0X
0
10
10
5
0
1
F0
52
5 a) What is F-Test? If the F statistic as 2.38 and the degrees of freedom obtained 10
52
2
AF
F
F
YF
YF
X5
0A
5Y
0A
0X
0X
by him were 8 and 3. Find out the F value from the F Table and determine
25
25
0
1
F1
2
F1
AF
F
YF
5
X5
A
0A
5Y
X
5Y
0X
0
0
5
0
tailed test).
F1
F0
1
F1
2
52
F
52
F
F
X5
A
0A
5Y
0A
Y
5Y
0X
0X
10
b) Find the simple linear regression equation that fits the given data and 25 10
F0
1
52
F1
2
AF
F
AF
F
X5
X5
A
coefficient of determination:
5Y
5Y
0X
5Y
10
10
10
F0
0
52
52
AF
X Y
2
F
F
YF
X5
A
0A
Y
5Y
0X
0X
10
2 69
5
25
0
1
1
2
2
AF
F
YF
F
YF
YF
5
X5
A
0A
9 98
X
0X
10
10
25
25
0
F0
1
2
AF
5 82
F
YF
YF
X5
F
5
X5
A
0A
5Y
0X
10
0
25
5 77
5
0
F0
1
F1
2
AF
52
AF
YF
F
X5
X5
0A
3 71
Y
5Y
0X
0
10
25
5
0
F0
F1
1
2
AF
52
7 84
AF
F
F
X5
5
0A
Y
5Y
X
0X
0
25
10
5
0
F0
F1
F1
2
2
AF
AF
X5
YF
X5
5
0A
Y
5Y
0X
0
25
10
F0
25
0
F1
AF
YF
X5
YF
5
0A
5
5Y
0X
Bottles of water have a label stating that the volume is 12 oz. A consumer
0
10
5
F0
25
0
F1
F1
52
2
AF
AF
F
group suspects the bottles are under‐filled and plans to conduct a test. What
X5
0A
X5
5Y
5Y
Y
0X
10
10
25
0
F0
52
52
AF
F
YF
F
X5
A
5Y
0A
0X
10
25
5
F0
52
F1
AF
2
AF
1. Chi-square distribution.
YF
YF
X5
5
0A
0X
5Y
0X
10
2. Weibull distribution.
25
25
F0
F1
AF
52
AF
YF
X5
5Y
X
0X
10
10
25
F0
0
F1
4. Box Plot
52
AF
YF
AF
YF
X5
0A
5Y
0X
10
25
10
5
F0
F1
52
2
AF
YF
X5
YF
X5
0A
5Y
0X
10
25
25
0
F1
52
AF
F
YF
X5
X5
0A
5Y
0X
10
25
F0
F0
1
52
AF
YF
YF
5
0A
0A
0X
0X
10
********************
25
25
F1
F1
AF
AF
YF
X5
5
5Y
5Y
X
10
10
F0
0
52
52
AF
YF
YF
0A
0X
0X
0
25
25
F1
1
AF
AF
YF
X5
X5
5Y
10
10
25
F0
F0
52
YF
YF
X5
0A
0A
0X
25
25
F0
F1
AF
X5
X5
0A
5Y
10
F0
F0
F1
52
YF
0A
0A
56538 Page 2 of 2
5Y
0X
25
F1
F1
52
AF
X5
5Y
5Y
0X
10
52
52
AF
YF
0X
X525YF10AF0X525YF10AF0X525YF10AF0X525YF10AF0
0X
10
25
¡¢£¤¥¦§¨©ª«¬¬®¯¦°±¦²¦³̈±´§¨¢µ¶·ª¡¦¦³µ¦³¥µ¸¨¹¦³̧³¥³´º±¦´³́»±¥¼¯¦¡¥³±¥
13/06/2025 TE CSE-AIML SEM-V C-SCHEME SAIDS QP CODE: 10083339
!"#
$%
&'()$&*+,- P[TK3狤狧
.'/)+-))0)1))213)
'/,,0)*-)0,
4'/)5,)671)0)66)*,),-
89:;; <==>?@=;<AB;CDEF; ;;GHIJ;
; K:;LMKN;OP;MQRSNMTPOP;NTPNOUV;W;>XRYKOU;NQRT;Z;KU[;NQRT;ZZ;T\\S\PW;; ;
; ]:;^_`abcdbecd_fghdbfi`jabafdak; ;
; l:;>XRYKOU;NMT;[OmmT\TUlT;]TNnTTU;oN\KNOmOT[;KU[;pYqPNT\;oKrRYOUV:; ;
; [:;>XRYKOU;sOUTK\;FTV\TPPOSU;KU[;ONP;<RRYOlKNOSUP:; ;
; T:;tTmOUT;PNKU[K\[;[TuOKNOSU;KU[;OUNT\vqK\NOYT;\KUVT;nONM;TXKrRYTP:; ;
8H:;K:;COU[;NMT;lS\\TYKNOSU;lSTmmOlOTUN;m\Sr;NMT;VOuTU;[KNK:;; G9IJ;
w5x)* y+))*)z w,-{
9; |; |I;
H; }; ~I;
; 9H; |;
; 9|; }|;
|; 9}; |;
~; HI; 9I|;
; ]:;LMKN;OP;pMOovqK\T;=TPNW;<;\TNKOY;lSrRKUQ;nKUNP;NS;[TNT\rOUT;Om;NMT\T;OP;K;;;G9IJ;
;
POVUOmOlKUN;KPPSlOKNOSU;]TNnTTU;lqPNSrT\;VTU[T\;KU[;R\TmT\TUlT;mS\;SUYOUT;
PMSRROUV;uP:;OUPNS\T;PMSRROUV:;=MT;lSrRKUQ;lSYYTlNT[;[KNK;m\Sr;K;\KU[Sr;
PKrRYT;Sm;HII;lqPNSrT\P;KU[;NMT;\TPqYNP;K\T;PqrrK\OT[;OU;NMT;mSYYSnOUV;
lSUNOUVTUlQ;NK]YT:;EPT;NMT;pMOovqK\T;=TPN;mS\;ZU[TRTU[TUlT;NS;[TNT\rOUT;Om;NMT\T;
OP;K;PNKNOPNOlKYYQ;POVUOmOlKUN;KPPSlOKNOSU;]TNnTTU;VTU[T\;KU[;PMSRROUV;R\TmT\TUlT;KN;
K;|;POVUOmOlKUlT;YTuTY;;
;
"+./!&+"%"3+$!!42!#+!/52,2.'+%,&61!&%"73&/+$!"+&//
/!8%+%,&%"9'#.'+!+$!):"#,!-,+$%""12'!1!&
>!"!#$!#,&/.#+!/".8!(,-#,''!*!"+./!&+"+,/!+!1%&!$,?1&(;<=
$,."+$!("2!&/"+./(%&*2!?!!@9!+!-!A.!&#(/%"+%>.+%,&+>'!-,+$!
/+2,8%/!/
<3<33<333<B3<33<<3<3<7373<3<3<3<33<B3<3<<3<33<3
<733<3<3<3<B3<733<<3<33<3<3<3<3<373<B3<3<<3<33<3
<73<3<
2$1#!.+%#'#,12&($"/!8!',2!/&!?/.*+$++$!(#'%1',?!" ;<=
>',,/2!"".!1,!!--!#+%8!'(+$&+$!#. !&+"+&///.*0$!8!*!
!/.#+%,&%&>',,/2!"".!-,2+%!&+"."%&*+$!"+&///.*%"<11C*3?%+$
"+&///!8%+%,&,-11C*0$!#,12&(#,&/.#+"#'%&%#'+%'?%+$B
2+%!&+"."%&*+$!&!?/.*&/,>"!8!"&8!*!!/.#+%,&,-<11C*+
"%*&%-%#&#!'!8!'3&"?!+$!-,'',?%&*D
<EF++!+$!&.''&/'+!&+%8!$(2,+$!"!"
E9'#.'+!+$!+!"+"++%"+%#
G!+!1%&!%-+$!&!?/.*%""++%"+%#''("%*&%-%#&+'(1,!!--!#+%8!+$&+$!
"+&///.*
>H%&/+$!"%12'!'%&!!*!""%,&!A.+%,&-,+$!*%8!&/+ ;<=
IJK LMNO
B <
7 <
< B
<
<
7P42'%&+$!#,&#!2+,-+?,:?(QRSC,?/,!"%+/%--!-,1,&!:?(;<=
QRSTG!"#%>!+$!"".12+%,&",-+?,:?(QRS&/$,?(,.?,.'/
#$!#@+$!"!"".12+%,&"'",3>%!-'(!42'%&H%!/1&U"+!"+"&,&:
21!+%#'+!&+%8!
> V %+!"$,+&,+!",&5&(+?,6 ;<=
<9$%:"A.!/%"+%>.+%,&
V!%>.''/%"+%>.+%,&
BF+!1WX!-Y',+
Z,4Y',+
[[[[[[[[[[[[[[[
01112 4567898
klmnopqprstunvwpxyzn{p|}}~pqpnmlownwpmwylpxysonpp{prwlwwvpyopowvlpwnnvnpplwlprvnvn
22/11/2024 CSE-AIML SEM-V C SCHEME DLOC-STAT. FOR AIDS QP CODE: 10067675
7955
8!"#!$%&'()#%*+!
,-../&'.!01+!.2*//!3*%&!.2/!*/&0"1"14!3"5/!6(/#."%1#!
7-##(&/!#(".08)/!90.0:!"3!*/6("*/9!019!#.0./!".!$)/0*)+!
!
8!-../&'.!01+!;!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! ! ! !!!! <!
0=>')0"1!'/*$/1.")/#!019!?%>')%.#!@".2!/>0&')/!
8A))(#.*0./!$/1.*0)!)"&".!.2/%*/&!@".2!0!1/0.!9"04*0&!
$=>')0"1!B())!019!0)./*10."5/!C+'%.2/#"#!@".2!/>0&')/!
9D20.!"#!C"#.%4*0&!E"5/!".#!0'')"$0."%1#!
/C%@!.%!9/./$.!%(.)"/*#F!
3D20.!"#!.2/!GHI/#.!(#/9!3%*F!
!
<! ! ! !! !!! ! ! ! ! ! ! 8!
0D20.!"#!J2"HK6(0*/!I/#.F!-!#.(9+!"#!$%19($./9!.%!/>0&"1/!.2/!*/)0."%1#2"'!8/.@//1!
4/19/*!019!@2/.2/*!0!'/*#%1!'*/3/*#!$%33//!%*!./0!I2/!90.0!$%))/$./9!"#!0#!3%))%@#L!
M7NO7P4777NQ7R9 77759S5T
5T7! 7U! U!;U!
V75T7! ,U! 7U!WU!
9S5T! WU! ;U!XU!
Y#/!$2"H#6(0*/!./#.!.%!3"19!0##%$"0."%1!8/.@//1!4/19/*!019!8/5/*04/!'*/3/*/1$/F!ZY#/!
[\]^]_`^!
!
8=>')0"1!B%*&0)!019!a%"##%1!b"#.*"8(."%1!c%#.!4*09(0./!#$2%%)#!%3!8(#"1/##!*/6("*/!
0'')"$01.#!3%*!09&"##"%1!.%!.0d/!.2/!E*09(0./!c0104/&/1.!-9&"##"%1!J%(1$")e#!
Ec-I!/>0&"10."%1!K$%*/#!%1!.2/!Ec-I!0*/!*%(42)+!1%*&0))+!9"#.*"8(./9!@".2!0!
&/01!%3!W,f!019!0!#.0190*9!9/5"0."%1!%3!,!I20.!"#!.2/!'*%808")".+!%3!01!"19"5"9(0)!
#$%*"14!08%5/!WUU!%1!.2/!Ec-IF!C%@!2"42!&(#.!01!"19"5"9(0)!#$%*/!%1!.2/!Ec-I!
"1!%*9/*!.%!#$%*/!"1!.2/!2"42/#.!WgF!!!!!! ! ! ! ! 8!
!
! ! ! ! !! ! ! ! ! ! ! 8!
0=>')0"1!#"14)/!019!c()."')/!)"1/0*!*/4*/##"%1!@".2!/>0&')/!019!#2%@!@".2!#(".08)/!
')%.!
!
8=>')0"1!.Hb"#.*"8(."%1!"1!9/.0")!I2/!J=h!%3!)"42.!8()8#!&01(30$.(*"14!$%&'01+!
$)0"&#!.20.!0!)"42.!8()8!)0#.#!7UU!90+#!-!*/#/0*$2/*!*019%&)+!#/)/$.#!W!8()8#!./#."14!
I2/!#0&')/9!8()8#!)0#.!01!05/*04/!.201!,XU!90+#F!9/5"0."%1!%3!WU!90+#!A3!.2/!J=he#!
$)0"&!@/*/!.*(/:!@20.!"#!.2/!'*%808")".+!.20.!W!*019%&)+!#/)/$./9!8()8#!@%()9!205/!
01!05/*04/!)"3/!%3!1%!&%*/!.201!,XU!90+#F!!!! ! ! ! 8!
!
i!! ! ! !!!!!!!!!!!! ! ! ! ! ! 8!
0=>')0"1!8*"/3)+!@2+!(#/!-Bhj-F!E"5/!9"33/*/1$/!8/.@//1!%1/H@0+!019!.@%H@0+!
-Bhj-!./#.!K%)5/!.2/!3%))%@"14!(#"14!%1/!@0+!011%50!
!
!
!
!
01012 456789
EFGHIJKJLMNOHPQJRSTHUJVWWXYJKJZHGFIQ[H\QJ]GQ^S\F_JRSMI`HJaJbUJLQFQ^`Q^P`JcSIJdIQ^c^P^F_Je\QH__^fH\PHJgJZFQFJLP^H\PH
01012 456789
hijiklmlnmohijiklmlnmohijiklmlnmohijiklmlnmo
cdefghihjklmfnohpqrfshtuuvwhihxfedgoyfzoh{eo|qzd}hpqkg~fhhshjodo|~o|n~hqghgo||n|d}hzof}}|fznfh hxdodhjn|fznf
!"#$%&# '()(
*&#&&+!"%,$ -!."/01)(2
3456786968:7;
"<=>?@>=AB6ACB6D54E=EF>FAG6AC=A6H6>FBI6EBAJBBK6L6=KM6NLO6F;B;O6PQL8H8NLR;
"&+&%&0(22!%S0("
"&+&%,&0T22!%U0T" VWX
Y0Z& [WX
*+*+\\
],#^+'&.#+
_++,`ab#+,
Z
bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb
01012 4567898
F
BE
8B
4C
E
F5
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
43
C5
FE
92
A7
5A
BE
E2
8B
C
3E
1T01865 - T.E. Computer Science & Enginering (Data Science) (Choice Based) (R-2019-20'C' Scheme) SEMESTER - V / 48885 -
EF
74
C2
C5
92
A4
Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
EA
BF
BE
01
E2
5
QP CODE: 10014523 DATE: 02/12/2022
F
8
3
[[Time: 3 Hours]
F7
[ Marks:80]
FE
92
4
C
2C
5A
F0
E
01
B
B
2E
EF
8
C1
N.B. 1. Question No. 1 is compulsory.
F7
C5
2
1C
A
BF
4C
2. Attempt any three questions out of remaining five.
BE
E2
F5
F
70
28
C1
A7
3. All questions carry equal marks
C5
FE
0F
1C
E9
C
3E
4. Assume Suitable data, if required and state it clearly.
E2
8B
1F
4
5B
7
A4
F7
92
CC
EA
2C
F5
F0
BE
1
74
3
70
2E
28
FE
C1
4
C5
A
A
Q.1 Attempt any four: 20
0F
1C
E9
8B
C
E
F5
E2
a) Find the standard deviation of the average temperatures recorded over a
F
74
43
70
5B
E
C1
92
C2
A
A
five-day period last winter: 19, 21, 18, 24, 12?
2C
BE
8B
C
E
5
F0
01
F
b) X is a normally distributed variable with mean μ = 30 and standard deviation
74
3
2E
C5
FE
1
2
F7
C
9
EA
A
σ = 4. Find:
1C
BE
E2
4C
5
F0
E2
F
i) P (x < 40), ii) P (30 < x < 35)
35)?
70
C2
C1
92
A7
A4
C2
C
BF
c) Discuss Boot strapping vs. rere-sampling
0F
E
01
E2
4C
E
F5
01
B
1F
d) The
he school principal wants to test if it is true what teachers say – that high
43
F7
C5
FE
2
A7
F7
C
CC
E9
5A
school juniors use the computer an average 3.2 hours a day. What are our
F0
8B
3E
F0
0
5B
E
EF
null and alternative hypotheses?
74
C1
F7
C1
2
A4
1C
2C
EA
BF
e) What do you mean by correlation and regression? Explain with example
4C
F0
BE
4C
F5
0
2E
28
43
C1
A7
C5
FE
A7
0F
E9
5A
4C
3E
E2
8B
3E
1F
5B
EF
Q.2 a) Find
ind the value of the correlation coefficient from the data given in the 10
A7
A4
F7
92
A4
C
2C
BF
following table:
C
3E
F5
BE
01
F5
F
4
2E
28
FE
C1
A7
A4
C5
FE
F
E9
8B
E2
8B
1F
4
70
5B
FE
2
A7
A4
1 43 99
C2
92
C
E9
2C
8B
4C
E
F5
2 21 65
BE
01
5B
1F
3
2E
FE
2
A7
4
F7
3 25 79
C5
2C
CC
E9
1C
8B
3E
F5
F0
4 42 E2 75
5B
2E
74
70
FE
C1
2
A4
5 57 87
C2
1C
2C
E9
EA
0F
8B
4C
F5
01
6 59 81
70
5B
2E
1F
3
FE
2
A7
4
F7
0F
1C
CC
E9
5A
E2
8B
3E
F0
70
b)
5B
10
EF
74
2
C1
2
A4
F
1C
E9
EA
BF
F0
4C
F5
70
5B
2E
28
43
C1
FE
A7
F
1C
2C
E9
5A
4C
F0
8B
3E
70
5B
2E
EF
C1
A7
92
A4
F
1C
2C
BF
4C
F0
F5
70
B
2E
28
C1
7
C5
FE
EA
0F
1C
E9
4C
E2
8B
1F
43
70
5B
7
C2
92
CC
EA
A
0F
2C
F5
BE
01
1F
74
43
2E
F7
C5
CC
EA
5A
C
F0
E2
EF
74
43
70
C1
C2
EA
5A
BF
0F
method:
4C
01
F
1F
28
43
FE
F7
CC
E9
EA
5A
8B
F0
F
74
43
FE
C1
92
EA
A
BE
8B
4C
F5
43
C5
FE
92
A7
14523 Page 1 of 3
A
BE
E2
8B
3E
F5
C2
C5
FE
92
A4
BE
E2
8B
F5
C2
C5
F701C2E2C5BE928BFEF5A43EA74CC1F0
FE
92
F
BE
8B
4C
E
F5
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
43
C5
FE
92
A7
5A
QP CODE: 10014523
BE
E2
8B
C
3E
EF
74
C2
C5
92
A4
EA
BF
BE
01
E2
F 5
8
3
Q.3 a) Explain type I & type 2 error in detail.
F7
10
FE
92
4
C
2C
5A
F0
(ii) What is the use of scatter plot and box plot?
E
01
B
B
2E
EF
8
C1
b) In
n a manufacturing unit, four teams of operators were randomly selected and 10
F7
C5
2
1C
A
BF
4C
0
sent to four different facilities for machining techniques training. After the
BE
E2
F5
F
70
28
C1
A7
training, the supervisor conducted the exam and recorded the test scores. At
C5
FE
0F
1C
E9
C
3E
95% confidence level does the scores are same in all four facilities?
E2
8B
1F
4
5B
7
A4
F7
(Hint: Use Kruskal–Wallis
Wallis test)
92
CC
EA
2C
F5
F0
BE
1
74
3
70
2E
28
FE
C1
4
C5
A
A
0F
1C
E9
8B
C
E
F5
E2
F
74
43
70
5B
E
C1
92
C2
A
A
F
2C
BE
8B
C
E
5
F0
01
F
74
3
2E
C5
FE
1
2
F7
C
9
EA
A
1C
BE
E2
4C
5
F0
E2
F
8
70
C2
C1
92
A7
A4
C2
C
BF
0F
E
01
E2
4C
Q.4 a) Iff the sample mean and expected mean value of th thee marks obtained by 15 10
E
F5
01
B
1F
8
43
F7
C5
FE
2
A7
students in a class test is 290 and 300 respectively. What is the tt-score
score if the
F7
C
CC
E9
5A
F0
8B
standard deviation of the marks is 50?
5
3E
F0
0
5B
E
EF
74
C1
F7
C1
2
A4
b) Find
ind out what is the relation between the GPA of a class of students and the 10
1C
2C
EA
BF
4C
F0
BE
4C
number of hours of study and the height of the student
F5
0
2E
28
43
C1
A7
C5
FE
A7
0F
E9
5A
4C
3E
E2
8B
3E
1F
5B
EF
A7
A4
F7
92
A4
C
2C
BF
C
3E
F5
BE
01
F5
F
4
2E
28
FE
C1
A7
A4
C5
FE
F
E9
8B
C
3E
F5
E2
8B
1F
4
70
5B
FE
2
A7
A4
C2
92
C
E9
2C
8B
4C
E
F5
BE
01
5B
1F
3
2E
FE
2
A7
4
F7
C5
2C
CC
E9
1C
8B
3E
F5
F0
E2
5B
2E
74
70
FE
C1
2
A4
C2
1C
2C
E9
EA
0F
8B
4C
F5
01
70
5B
2E
1F
3
FE
2
A7
4
F7
0F
1C
CC
E9
5A
E2
8B
3E
F0
70
5B
EF
74
2
C1
2
A4
F
1C
2C
E9
EA
BF
F0
Q.5 a) A farmer is trying out a planting technique that he hopes will increase the 10
4C
F5
70
5B
2E
28
43
C1
yield on his pea plants. The average number of pods on one of his pea plants
FE
A7
F
1C
2C
E9
5A
4C
F0
is 145 pods with a standard deviation of 100 pods. This year, after trying his
8B
3E
70
5B
2E
EF
C1
A7
new planting technique, he takes a random sample of his plants and finds the
92
A4
F
1C
2C
BF
4C
F0
B
2E
28
C1
7
significant increase. What are his hypotheses and the test statistic? Use a
C5
FE
EA
0F
1C
E9
4C
8B
1F
43
70
5B
7
C2
b) Find
ind the simple linear regression equation that fits the given data and 10
92
CC
EA
A
0F
2C
F5
BE
01
coefficient of determination:
1F
74
43
2E
F7
C5
Hour Temp
CC
EA
5A
C
F0
E2
2 21
EF
74
43
70
C1
C2
EA
5A
4 27
BF
0F
4C
01
6 29
F
1F
28
43
FE
F7
CC
E9
EA
5A
8 86
8B
F0
F
74
43
10 86
FE
C1
92
EA
A
12 92
BE
8B
4C
F5
43
C5
FE
92
A7
14523 Page 2 of 3
A
BE
E2
8B
3E
F5
C2
C5
FE
92
A4
BE
E2
8B
F5
C2
C5
F701C2E2C5BE928BFEF5A43EA74CC1F0
FE
92
C2 E9 F5 A7 0F
E2 28 A43
4C 70
1C
C5 BF C1
BE EF EA F0 2E
C2
E2 92 5A 7 4C F 70 2C
C5 8B 43 C1 1C 5B
FE EA F0 2E E9
BE
92 F 5A 7 F 2C 2 8B
4C 70 5B FE
C5 8B 43 C1 1C
BE FE EA F0 2 E9
2
F5
92 F 5A 74 F E2
C 8B A4
Q.6
8B CC 70 5B FE 3E
43 1C A7
14523
FE EA 1F 2E E9 F5
a)
b)
92 F5 74 0F 2C 2 8B A4 4C
8B A 43 CC 70
1C 5B FE 3E C1
F0
FE EA 1F E9 F5 A7 F7
2E 2 4C
F5 7 0F 2C 8B A4 01
A 4C 70 3E C1
years if
43 1C 5B C2 FE F0
C1 E9 E2 F5 A7
FE EA F0 2E 4 F7
F5 A 4 C 2
0 1
C5
74 F7 2C 8B 3 C C BE
A4
3E
CC 01 5B FE E A7
1F 2 E2 92
1F C2 E9 F5 0F
A7 0F E2 28 A4 4 C 7 0 C 5
8B
FE
iii. Standard Error
4C 70 C5 BF 3E C1 1C BE
1 C B F 0 2 E 9 2
F5
C1 E EF A7 2 A4
i. Confidence Interval
F0 2E 92 5A 4C F7 8B
F7 2C 8B 4 C 0 1
C5
B F E
3E
3 C
ii. Central Limit Theorem
01 5B FE EA 1F 2E E F5 A7
Write short notes on (any two)
C2 E9 F5 74 0 F 2C
92
8 A 4
4C
E2 28 7 B 3 C1
i. All five people are still living.
A4 CC 01 5B FE EA
C5 BF 3E 1F C 2 E9 F 7
F0
BE EF A7 0 E2 2 5A 4C
F7
F 8 01
ii. At least three people are still living.
92 5A 4C 70 C5 BF 43 C1
8B 43 1 C E E A F C2
0
Page 3 of 3
C1 BE F E2
FE EA F0 2E 9 2 5A
74 F7 C5
F5 74 F7 2C 8B 43
CC 01
C BE
A4 CC 01 5B FE E A
1F 2
õõõõõõõõ-
3E 1F C2 E9 F5 74 0F E2
A7 0F E2 28 A4 C 70 C5
F701C2E2C5BE928BFEF5A43EA74CC1F0
4C 70
1C
C5 BF 3 EA
C1
F0
1C
2E
BE
C1 BE EF 92
F0 2E 92 5A 74 F7 2C 8B
F7 2C 8B 43 C C1 0 1C 5 BE F
01 5B FE E A7 F 2 E2 92
C2 E9 F5 0F 8B
E2 28 4C 70 C5
iii. Exactly two people are still living. (Hint: Binomial Distribution)
A4 1 B
C5 BF 3E C 1 C E
FE
BE EF A7 F0 2 E2 92 F5
92 5A 4C F 70 C5 8 BF
According to recent data, the probability of a person living in these
8B
conditions for 30 years or more is 2/3. Calculate the probability that after 30
43 C1 1C BE EF
FE EA F0 2E 92 5A
F5 8
10
An agent sells life insurance policies to five equally aged, healthy people. 10
74 F7 2C B 43
A4 CC 01 5B FE E
3E 1F C2 E9 F 5
A7 0F E2 28
QP CODE: 10014523
A4
4C 70 C5 BF 3E
C1 1C BE EF A7
F0 2E 92 5A 4C
F7
01
2C 8B 4 3
5B FE EA
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science
C2 E9 F5 74
E2 28 A C