0% found this document useful (0 votes)
45 views20 pages

SAIDS Mu Ques Paper Merged

MU ques papers

Uploaded by

Manasvi Devlekar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views20 pages

SAIDS Mu Ques Paper Merged

MU ques papers

Uploaded by

Manasvi Devlekar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

1

3A
2
32

8A
68

D4
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

3B

C4
63

AB
F9
45

42
32

A
68

43
2E

9D

B8
1T01875 - T.E. Computer Science & Enginering (Artificial Intelligence & Machine Learning) (Choice Based)

3B

63

2C
96

3A
45

32
(R-2019 'C' Scheme) SEMESTER - V / 48895 - Department Optional Course - 1: Statistics for Artificial

68

4
AE

2E

D
B

4
63

2C
F9
Intelligence & Data Science QP CODE: 10039067 DATE: 04/12/2023

3
6
C2

45

32
9

4
Duration: 3hrs [Max Marks:80]

36
15

D
B
A

62

C
F9
3

26
8A

42
9
C

68
E4

B3
AE
AB

15

D
*KI3狤狥

63
2

F9
53
8A

96
(1) Question No 1 is Compulsory.

2
43

32
5C

8
E4
E
B
2C

36
(2) Attempt any three questions out of the remaining five.

3B
2A
1
3A

62

26
A
D4

45
(3) All questions carry equal marks.

9
C
8
C4

B3
AE
B

36
5

2E
F9

A1
(4) Assume suitable data, if required and state it clearly.

A
2

53

26
68

96
D4

C2
43

B8

E4

3
63

AE
C

15
9

3B
F

3A

62
42
32

8A
68

C2

45
1 Attempt any four [20]

E9
D
3B

C4
3

2E
F9
6

2A
Write a short note on hypothesis testing.

A1
a)

A
45

42
32

96
43
36
2E

C
8
b) What is Fisher's exact test?
3B

4
AE
AB
2C

15

2E
9
26
96

8F
5

c) Write a short note Simple Linear Regression

8A
4

96
D4

C2
3

3
AE

6
2E

C4
d) Write a short note on Random sampling

63

AE
AB

5
F9
3
6
C2

A1
45

2
32
e) What is the empirical CDF function?
9

C2
43
E

6
15

B8
B
2A

3
62

2C

15
9
3

6
8A

8F

3A
45

2
E9
5C

8A
2 a) Construct a frequency distribution table for the following weights (in gm) of 30 [10]

4
3
AB

36
E

9D
3B

C4
A
A1

AB
62

oranges using the equal class intervals, one of them is 40-45 (45 not included).

26
2

F
43

42
9
5C
B8

68
E4
The weights are: 31, 41, 46, 33, 44, 51, 56, 63, 71, 71, 62, 63, 54, 53, 51, 43,

43
E
2C

9D
3B
2A

63
1
3A

62

2C
36, 38, 54, 56, 66, 71, 74, 75, 46, 47, 59, 60, 61, 63.
A
D4

8F
45

32
9
C
8
C4

D4
E
B

36
5

E
F9

3B
2A
A1
3A

2
42

F9
6
(a) What is the class mark of the class intervals 50-55?
68

45

32
E9
9D

5C
B8
C4

68
63

(b) What is the range of the above weights?


2E

B
A

63
A1
8F

3A
42
32

53
6
C2

(c) How many class intervals are there?

32
36

9
9D

B8
3B

C4

E4
AE
5

(d) Which class interval has the lowest frequency?

3B
26

A1
8F

A
45

62
42

2
B3

43

45
b) What is the primary purpose of conducting a one-way ANOVA. Explain the [10]
36
2E

E9
9D

C
8
AB
2C

2E
53

26

key components of a one-way ANOVA, including the dependent variable,


96

2A
1
8F

8A
E4

96
4
B3

43
AE

independent variable, and factors.


36

5C

AE
AB
62

C
F9
53

26
C2

A1
2
E9

8
E4

D4

C2
B3

43
6

B8
2A

3
62

2C

15
9
53

8F

3A

3 a) Find the standard error of the estimate for the average number of children in a [10]
2
E9
5C

8A
E4

4
B3

36

9D

C4

household in your city by using the data collected from a sample of households
2A
A1

AB
62

53

8F

42
2

in your city. Then find a 95% confidence interval for the data.
E9
5C
B8

E4

43
36

9D
3B
2A
A1
3A

62

2C
26

8F
5
E9
5C
B8

Household No. of children


4

D4
3

36
E

3B
2A
A1
3A

62

1 2
F9
26
45
E9
5C
B8
C4

68
B3

2 3
E
2A

63
A1
3A

2
42

53

3 1
6

32
9
D

5C
B8
C4

E4
AE

4 0
F9

3B
A1
3A

62
42

5 5
45
E9
9D

5C
B8
C4

2E

6 2
2A
A1
8F

3A
42

96

7 1
36

C
B8
C4

AE
5
F9
26

A1

8 4
3A
42
68

C2
B3

9D

B8
C4

b) What is the concept of correlation in statistics, how is it different from [10]


63

15
8F

3A
42

regression?
32

8A
36

D
3B

C4

AB
F9
26

39067 Page 1 of 2
45

42
68
B3

43
2E

D
63

2C
F9
53
96

32

68
E4

D4
3B

63
62

F9
45

2AE962E453B326368F9D42C43AB8A15C
32
E9

68
1
F

3A
2
32

8A
68

D4
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

3B

C4
63

AB
F9
45

42
32

A
68

43
2E

9D

B8
3B

63

2C
96

3A
45

32

68

4
AE

2E

D
B

4
63

2C
F9
3
6
C2

45

32
9

4
E
4 a) A radar unit is used to measure speeds of cars on a motorway. The speeds are [10]

36
15

D
B
A

62

C
F9
3

26
8A
normally distributed with a mean of 90 km/hr and a standard deviation of 10

42
9
C

68
E4

B3
AE
AB

15
km/hr. What is the probability that a car picked at random is travelling at more

D
63
2

F9
53
8A

96
2
than 100 km/hr?

43

32
5C

8
E4
E
B
2C

36
3B
b) Explain Numerical and Categorical data types with appropriate examples [10]

2A
1
3A

62

26
A
D4

45
9
C
8
C4

B3
AE
B

36
5

2E
F9

A1
A
2

53

26
68

96
D4

C2
43
5 a) Duracell manufactures batteries that the CEO claims will last an average of 300 [10]

B8

E4

3
63

AE
C

15
9

3B
F
hours under normal use. A researcher randomly selected 20 batteries from the

3A

62
42
32

8A
68

C2

45
E9
D
3B

C4
production line and tested these batteries. The tested batteries had a mean life

2E
F9
6

2A
A1
A
45

42
span of 270 hours with a standard deviation of 50 hours. Do we have enough
32

96
43
36
2E

C
8
3B

4
evidence to suggest that the claim of an average lifetime of 300 hours is false?

AE
AB
2C

15

2E
9
26
96

8F
5

b) Explain linear least square regression (LLSR) along with it’s advantages and [10]

8A
4

96
D4

C2
3

3
AE

6
2E

C4
63

AE
disadvantages.

AB

5
F9
3
6
C2

A1
45

2
32
9

C2
43
E

6
15

B8
B
2A

3
62

2C

15
9
3

6
8A

8F

3A
45

2
E9
5C

8A
4
6 a) A farmer is trying out a planting technique that he hopes will increase the yield [10]
3
AB

36
E

9D
3B

C4
A
A1

AB
62

on his pea plants. The average number of pods on one of his pea plantsis 145

26
2

F
43

42
9
5C
B8

68
E4

43
E

pods with a standard deviation of 100 pods. This year, after trying his new
2C

9D
3B
2A

63
1
3A

62

2C
A

planting technique, he takes a random sample of his plants and finds theaverage
D4

8F
45

32
9
C
8
C4

D4
E
B

number of pods to be 147. He wonders whether or not this is a statistically

36
5

E
F9

3B
2A
A1
3A

2
42

F9
6
68

significant increase. What are his hypotheses and the test statistic? 45

32
E9
9D

5C
B8
C4

68
63

2E
What is the Chi-Square Test in statistics, and in what kind of situations or

B
b) [10]
A

63
A1
8F

3A
42
32

53
6
C2

research scenarios is it commonly used?

32
36

9
9D

B8
3B

C4

E4
AE
5

3B
26

A1
8F

A
45

62
42

2
B3

43

45
36
2E

E9
9D

C
8
AB
2C

2E
53

26
96

2A
1
8F

8A
E4

96
4
B3

43
AE

36

5C

*********************
AE
AB
62

C
F9
53

26
C2

A1
2
E9

8
E4

D4

C2
B3

43
6

B8
2A

3
62

2C

15
9
53

8F

3A
2
E9
5C

8A
E4

4
B3

36

9D

C4
2A
A1

AB
62

53

8F

42
2
E9
5C
B8

E4

43
36

9D
3B
2A
A1
3A

62

2C
26

8F
5
E9
5C
B8

D4
3

36
E

3B
2A
A1
3A

62

F9
26
45
E9
5C
B8
C4

68
B3
E
2A

63
A1
3A

2
42

53
6

32
9
D

5C
B8
C4

E4
AE
F9

3B
A1
3A

62
42

45
E9
9D

5C
B8
C4

2E
2A
A1
8F

3A
42

96
36

C
B8
C4

AE
5
F9
26

A1
3A
42
68

C2
B3

9D

B8
C4
63

15
8F

3A
42
32

8A
36

D
3B

C4

AB
F9
26

39067 Page 2 of 2
45

42
68
B3

43
2E

D
63

2C
F9
53
96

32

68
E4

D4
3B

63
62

F9
45

2AE962E453B326368F9D42C43AB8A15C
32
E9

68
Paper / Subject Code: 48895 / Department Optional Course - I: Statistics for
Artificial Intelligence & Data Science

T10I885 - T.E. Computer Science & Engineering (Data Science) (Choice Based)
(R-2019-20’C Scheme) SEMESTER - V / 48895 - Department Optional Course -
I: Statistics for Artificial Intelligence & Data Science
QP CODE: 10014523
Date: 02/12/2022
[Marks: 80]
[Time: 3 Hours]

Q.1 (Attempt any four) (20 Marks)

a) Find the standard deviation of the average temperatures recorded over a five-
day period last winter: 19, 21, 18, 24, 12?

b) XX is a normally distributed variable with mean μ = 30 and standard deviation


σ = 4. Find:
i. P(X<40)P(X < 40)
ii. P(30<X<35)P(30 < X < 35)

c) Discuss Bootstrapping vs. re-sampling.

d) The school principal wants to test if it is true what teachers say - that high
school juniors use the computer an average 3.2 hours a day. What are null and
alternative hypotheses?

e) What do you mean by correlation and regression? Explain with example.

Q.2 (10 Marks)

a) Find the value of the correlation coefficient from the data given in the
following table:

SUBJECT AGE (X) GLUCOSE LEVEL (Y)

1 43 99

2 21 65

3 25 79

4 42 75

5 57 87

6 59 81

b) Explain briefly why ANOVA is used? Solve using One-way ANOVA:


OBSERVATIONS A B C

1 25 20 18

2 30 28 36

3 28 30 34

4 38 35 22

5 31 35 28

Q.3 (10 Marks)

a) Explain type I & type II error in detail.


(i) What is the use of scatter plot and box plot?

b) In a manufacturing unit, four teams of operators were randomly selected and


sent to four different facilities for machining techniques training. After the
training, the supervisor conducted the exam and recorded the test scores. At
95% confidence level does the scores are same in all four facilities?
(Hint: Use Kruskal-Wallis test)

Facility 1 Facility 2 Facility 3 Facility 4

88 77 71 69

82 76 56 65

86 84 64 68

87 59 51 81

Q.4 (10 Marks)

a) If the sample mean and expected mean value of the marks obtained by 15
students in a class test is 290 and 300 respectively. What is the t-score if the
standard deviation of the marks is 50?

b) Find out what is the relation between the GPA of a class of students and the
number of hours of study and the height of the student:

GPA Height Study Hours

2.9 66 7

3.0 65 8
GPA Height Study Hours

3.62 67 7

3.2 62.5 7

3.1 64 8

2.8 63 6

3.63 68 9

3.84 65 6

3.93 69 10

3.76 64 7

2.75 59 4

Q.5 (10 Marks)

a) A farmer is trying out a planting technique that he hopes will increase the
yield on his pea plants. The average number of pods on one of his pea plants is
145 pods with a standard deviation of 100 pods. This year, after trying his new
planting technique, he takes a random sample of his plants and finds the
average number of pods to be 147. He wonders whether this is a statistically
significant increase. What are his hypotheses and the test statistic? Use a 0.05
significance level.

b) Find the simple linear regression equation that fits the given data and
coefficient of determination:

Hour Temp

1 21

2 27

4 25

8 86

10 92

12 96

Q.6 (10 Marks)


a) An agent sells life insurance policies to five equally aged, healthy people.
According to recent data, the probability of a person living in these conditions for
30 years or more is 2/3. Calculate the probability that after 30 years if:
i. All five people are still living.
ii. At least three people are still living.
iii. Exactly two people are still living.
(Hint: Binomial Distribution)

b) Write short notes on (any two):


i. Confidence Interval
ii. Central Limit Theorem
iii. Standard Error

Paper 1

Paper / Subject Code: 48895 / Department Optional Course - I: Statistics for


Artificial Intelligence & Machine Learning
QP CODE: 1003967
Date: 02/12/2023
[Max Marks: 80]

Q.1 (20 Marks) (Attempt any four)

a) Write a short note on hypothesis testing.


b) What is Fisher’s exact test?
c) Write a short note on Simple Linear Regression.
d) Write a short note on Random sampling.
e) What is the empirical CDF function?

Q.2 (10 Marks)

a) Construct a frequency distribution table for the following weights (in gm) of
30 oranges using equal class intervals, one of them is 40-45.
Weights: 31, 41, 46, 33, 44, 51, 56, 61, 71, 63, 52, 64, 53, 51, 43, 36, 38, 54,
56, 66, 71, 74, 45, 47, 60, 59, 63, 61, 63, 60

(i) What is the class mark of the class interval 50-55?


(ii) What is the range of the above weights?
(iii) How many class intervals are there?
(iv) Which class interval has the lowest frequency?

b) What is the primary purpose of conducting a one-way ANOVA? Explain the


key components of a one-way ANOVA, including the dependent variable,
independent variable, and factors.

Q.3 (10 Marks)


a) Find the standard error of the estimator for the average number of children in
a household in your city using the data collected from a sample of households in
your city. Then find a 95% confidence interval for the data.

Household No. of Children

1 2

2 1

3 2

4 2

5 1

6 2

7 4

8 2

9 1

10 2

b) What is the concept of correlation in statistics, how is it different from


regression?

Q.4 (10 Marks)

a) A radar unit is used to measure speeds of cars on a motorway. The speeds


are normally distributed with mean = 90 km/hr and standard deviation = 10
km/hr. What is the probability that a car picked at random is travelling at more
than 100 km/hr?

b) Explain Numerical and Categorical data types with appropriate examples.

Q.5 (10 Marks)

a) Duracell manufactures batteries that the CEO claims will last an average of
300 hours under normal use. A researcher randomly selected 20 batteries from
the production line and tested these batteries. The tested batteries had a mean
life span of 270 hours with a standard deviation of 50 hours. Do we have enough
evidence to suggest that the claim of an average lifetime of 300 hours is false?

b) Explain Linear Least Square Regression (LLSR) along with its advantages and
disadvantages.
Q.6 (10 Marks)

a) A farmer is trying out a planting technique that he hopes will increase the
yield on his pea plants. The average number of pods on one of his pea plants is
145 pods with a standard deviation of 100 pods. This year, after trying his new
planting technique, he takes a random sample of his plants and finds the
average number of pods to be 147. He wonders whether this is a statistically
significant increase. What are his hypotheses and the test statistic?

b) What is the Chi-Square test in statistics, and in what kind of situations or


research scenarios is it commonly used?

Paper 2

Paper / Subject Code: 48895 / Department Optional Course - I: Statistics for


Artificial Intelligence & Data Science
QP CODE: 10056538
Date: 13/06/2024
[Max Marks: 80]

Q.1 (20 Marks) (Attempt any four)

a) Define Confidence Interval.


b) A certain property investment company with an international presence,
workers have a mean hourly wage of $12 with population standard deviation of
$3. Given a sample size of 30, estimate and interpret the SE of the sample
mean.
c) What is hypothesis testing? Explain type I and type II errors?
d) What do you mean by correlation and regression? Explain with example.
e) What is an analysis of variance? Explain its usage.

Q.2 (10 Marks)

a) XX is a normally distributed variable with mean μ = 30 and standard deviation


σ = 4. Find:
i. P(X<40)P(X < 40)
ii. P(X>21)P(X > 21)
iii. P(30<X<35)P(30 < X < 35)

b) Some vehicles pass through a junction on a busy road at an average rate of


300 per hour.
i. Find the probability that none passes in a given minute.
ii. What is the expected number of passing in two minutes?
iii. Find the probability that this expected number found above actually passes
through in a given two-minute period.
Q.3 (10 Marks)

a) For a certain type of computers, the length of time between charges of the
battery is normally distributed with a mean of 50 hours and a standard deviation
of 15 hours. John owns one of these computers and he wants to know the
probability that the length of time will be between 50 and 70 hours.

b) The average score on a test is 80 with a standard deviation of 10. With a new
teaching curriculum introduced it is believed that this score will change. On
testing random scores of 38 students, the mean score is 77. At 0.05 significance
level, is there any evidence to support this claim?

Q.4 (10 Marks)

a) Explain QQ plots in detail. Show how scatterplots explore relationships


between variables.

b) Given four samples A, B, C, D. Solve using one-way ANOVA to identify any


difference between samples.

Observation A B C D

1 12 12 18 13

2 10 11 12 9

3 12 10 16 12

4 8 14 6 16

5 7 9 12 15

Q.5 (10 Marks)

a) What is F-test? If the F statistic as 2.38 and the degrees of freedom obtained
by him were 8 and 3. Find out the F value from the F table and determine
whether we can reject the null hypothesis at 5% level of significance (one-tailed
test).

b) Find the simple linear regression equation that fits the given data and
coefficient of determination:

XY

2 69

3 68
XY

5 82

5 77

6 71

7 84

Q.6 (10 Marks)

a) Explain Binomial distribution in detail. Bottles of water have a label stating


that the volume is 12 oz. A consumer group suspects the bottles are under-filled
and plans to conduct a test. What would a Type I error in this situation mean?

b) Write short notes on (any two):

1. Chi-square distribution

2. Weibull distribution

3. Stem & Leaf Plot

4. Box Plot
Y
X

0X
10

10
25
F0
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

AF
YF

YF
X5
0A

10
25

25
June 13, 2024 02:30 pm - 05:30 pm 1T01875 - T.E. Computer Science & Enginering

F0

F0
F1

YF
X5

X5
0A
(Artificial Intelligence & Machine Learning) (Choice Based) (R-2019 'C' Scheme) SEMESTER - V /

0A
5Y

25
F0

F0
48895 - Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

1
2

YF

YF
X5

X5
0A

0A
QP CODE: 10056538

25
0

F0
F1

1
2
F

YF
5

X5
0A
Duration: 3hrs [Max Marks:80]

0A
5Y

0X

5
0
F1

1
52

52
AF

AF
YF
X5
5Y
0X

0X
N.B. 1. Question No. 1 is compulsory.

10

10
5
F0
52

2
AF

AF
F
2. Attempt any three questions out of remaining five.

YF
X5
0A
5Y
X
10

0
5
0
3. All questions carry equal marks

F0
F1

1
2

2
F
F

YF
X5

X5
A

0A
5Y

Y
4. Assume Suitable data, if required and state it clearly.

10

25

5
0

F0
1
52

1
2
F
F

YF

F
5

X5
0A

0A
5Y
0X

5Y
0X
1 Attempt any four: 20

F0
1

1
52

2
AF

2
F
F

YF
X5
(a) Define Confidence Interval?

X5
0A

0A
5Y
0X
10

5
0

F0
(b) In a certain property investment company with an international presence,

F1
52

2
AF

F
F

X5
A
5Y

0A
5Y

Y
0X
workers have a mean hourly wage of $12 with a population standard deviation

10

10

25
0

0
52

1
52
AF

F
F

AF
of $3. Given a sample size of 30, estimate and interpret the SE of the sample

YF

YF
X5
0A
5Y
0X

0X
0

10
mean.

25

25
F0
1

F1
52
AF

F
F

YF
5

X5
A

0A
(c) What is hypothesis testing? Explain type I and type II errors?
5Y

5Y
0X

0X
10

25
0
1

F1
52
(d) What do you mean by correlation and regression? Explain with example.

52
AF

F
YF

F
F

X5
0A

A
Y

5Y
0X

0X
(e) What is analysis of variance? Explain its usage.
10
25

10
25

F0
1

2
F

AF
F

F
X5

YF
X5

X5
(a) X is a normally distributed variable with mean μ = 30 and standard deviation 10
0A
5Y

0A
2
5Y

10
F0

25
0

0
F1

σ = 4. Find
52

F1
52
AF

F
F
0A

X5
0A
5Y

5Y
0X

5Y
0X
a) P(x < 40)
10
F1

F0
1
52

52
AF

52
F
YF

F
b) P(x > 21)
A
5Y

0A
Y
0X

0X

0X
10

10
25

25
c) P(30 < x < 35)
52

F1
AF

F
YF

AF
YF
X5

X5
0A
0X

5Y
(b)
10. Some vehicles pass through a junction on a busy road at an average rate of 300 10
10
25

10
5
F0

F0
1
52
AF

52
per hour.
YF

F
X5

YF
0A

0A
Y
0X

0X
10

25

a. Find out the probability that none passes in a given minute.


5
F0

25
F1

F1
2
AF
YF

F
X5

5
0A

X5
b. What is the expected number of passing in two minutes?
A
5Y

5Y
X
10
25

10
F0

F0
F1

c. Find the probability that this expected number found above actually F0
52

52
YF
X5

YF
0A

0A
5Y

0A
0X

0X

pass through in a given two-minute period.


25
F0

25
F1

1
52

F1
AF

AF
YF
X5
0A

X5
5Y
0X

5Y
10

10
25
F0

3 (a) For a certain type of computers, the length of time between charges of the 10
F1

F0
52
AF

52
YF

F
X5
0A
5Y

0A
Y
0X

battery is normally distributed with a mean of 50 hours and a standard


0X
10

25

25
0
F1
52

F1
AF

AF

deviation of 15 hours. John owns one of these computers and wants to know
YF

AF
X5

X5
5Y
0X

5Y
10

10

the probability that the length of time will be between 50 and 70 hours.
25

10
F0

0
52
AF

52
AF
YF

YF
X5

YF
0A

0X

0X
10
25

25
F0

25
F1

(b) The average score on a test is 80 with a standard deviation of 10. With a new 10
AF

AF
YF
X5

X5
0A

X5
5Y

teaching curriculum introduced it is believed that this score will change. On


10

10
25
F0

F0
F1

F0
2

YF

YF

random testing, the score of 38 students, the mean was found to be 88. With a
X5

X5
0A

0A
5Y

0A
25

25
F0

0.05 significance level, is there any evidence to support this claim?


F0
F1

F1
2

F1
X5

X5

X5
0A

0A
5Y

5Y

4 a) Explain QQ plots in detail. Show how scatterplots explores relationships 10


5Y
0

F0
F1

F1
52

52
AF

between variables.
X5

0A
5Y

5Y
0X

0X
10

F0

F1
52

52
AF

AF
YF

0A

5Y
0X

0X
10

10
25

2
AF

AF
YF

YF

YF
X5

X5
10

10
25

25
F0

F0
YF

YF
X5

X5
0A

0A
25

25
F0

0
F1

F1

AF
X5

X5
0A
5Y

5Y

10
F0

F0
F1
52

YF
X5
0A

0A
5Y
0X

25
0
F1
2
AF

AF
X5

X5
5Y
10

10
F0

F0
52
YF

YF
0A

0A

56538 Page 1 of 2
0X
25

25
F1

F1
AF
X5

X5
5Y

5Y
10

0
52

52
AF
YF
0X

X525YF10AF0X525YF10AF0X525YF10AF0X525YF10AF0
0X
10
25
Y
X

0X
10

10
25
F0
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

AF
YF

YF
X5
0A

10
25

25
F0

F0
F1

YF
X5

X5
0A

0A
5Y

25
F0

F0
1

1
2

YF

YF
X5

X5
0A

0A
5
b) Given four samples A, B, C, D. Solve using one-way ANOVA to identify any 10

25
0

F0
F1

1
2
F

YF
5

X5
0A

0A
difference between samples.

5Y

0X

5
0
F1

1
52

52
AF

AF
YF
Observation A B C D

X5
5Y
0X

0X
10

10
5
0
52

2
AF

AF
F

YF
1 8 12 18 13

X5
0A
5Y
X
10

10
5
0

F0
F1
2

2
F
YF
2 10 11 12 9

YF
X5

X5
A

0A
Y
10
25

25

25
0

F0
F1

F1
F
YF
3 12 9 16 12

X5

X5
A

0A
5Y

5Y
0X
0
25
F0

F0
1

1
2

52
F
F
4 8 14 6 16

YF
X5

X5
0A

0A

0A
5Y

0X
5
F0

0
F1

1
2

2
F

AF
F
5 7 4 8 15

YF
5

X5
A

A
5Y

5Y
0X
0

10

10
5
0
1

F0
52

5 a) What is F-Test? If the F statistic as 2.38 and the degrees of freedom obtained 10

52

2
AF

F
F

YF

YF
X5
0A
5Y

0A
0X

0X
by him were 8 and 3. Find out the F value from the F Table and determine

25

25
0
1

F1
2

F1
AF

F
YF
5

whether we can reject the null hypothesis at 5% level of significance (one-

X5
A

0A
5Y
X

5Y
0X
0

0
5
0

tailed test).
F1

F0
1

F1
2

52
F

52
F
F
X5
A

0A
5Y

0A
Y

5Y
0X

0X
10

b) Find the simple linear regression equation that fits the given data and 25 10
F0

1
52

F1
2
AF
F

AF
F
X5

X5
A

coefficient of determination:
5Y

5Y
0X

5Y
10

10

10
F0

0
52

52
AF

X Y

2
F
F

YF
X5
A

0A
Y

5Y
0X

0X
10

2 69
5

25
0
1

1
2

2
AF

F
YF

F
YF

YF
5

X5
A

0A
9 98
X

0X
10

10
25

25
0

F0
1
2
AF

5 82
F
YF

YF
X5

F
5

X5
A

0A
5Y
0X
10

0
25

5 77
5
0

F0
1

F1
2
AF

52
AF
YF

F
X5

X5

0A

3 71
Y

5Y
0X
0

10
25

5
0

F0
F1

1
2
AF

52
7 84
AF
F

F
X5

5
0A
Y

5Y
X

0X
0
25

10
5
0

F0
F1

F1
2

2
AF

AF
X5

YF
X5

5
0A
Y

5Y

0X
0
25

10
F0

25
0
F1

6 a) Explain Binomial distribution in detail. 10


2
AF

AF
YF
X5

YF
5
0A

5
5Y

0X

Bottles of water have a label stating that the volume is 12 oz. A consumer
0

10
5
F0

25
0
F1

F1
52

2
AF

AF
F

group suspects the bottles are under‐filled and plans to conduct a test. What
X5
0A

X5
5Y

5Y

Y
0X

10

10
25
0

would a Type I error in this situation mean?


F1

F0
52

52
AF

F
YF

F
X5
A
5Y

0A
0X

b) Write short notes on (any two) 10


0X
10

10
25

5
F0
52

F1
AF

2
AF

1. Chi-square distribution.
YF

YF
X5

5
0A
0X

5Y
0X
10

2. Weibull distribution.
25

25
F0

F1
AF

52
AF
YF
X5

3. Stem & Leaf Plot


0A

5Y
X

0X
10

10
25
F0

0
F1

4. Box Plot
52
AF
YF

AF
YF
X5
0A

5Y

0X
10
25

10
5
F0
F1

52

2
AF
YF
X5

YF
X5
0A
5Y

0X

10
25

25
0
F1
52

AF

F
YF
X5

X5
0A
5Y
0X

10

25
F0

F0
1
52
AF

YF

YF
5
0A

0A
0X

0X
10

********************
25

25
F1

F1
AF

AF
YF

X5

5
5Y

5Y
X
10

10
F0

0
52

52
AF
YF

YF
0A

0X

0X
0
25

25
F1

1
AF

AF
YF
X5

X5
5Y

10

10
25
F0

F0
52

YF

YF
X5
0A

0A
0X

25

25
F0
F1
AF

X5

X5
0A
5Y
10

F0

F0
F1
52
YF

0A

0A

56538 Page 2 of 2
5Y
0X
25

F1

F1
52
AF
X5

5Y

5Y
0X
10
52

52
AF
YF
0X

X525YF10AF0X525YF10AF0X525YF10AF0X525YF10AF0
0X
10
25
š›œžŸŸ¡¢£¤¥¦Ÿ§¨©ªŸ«¬¬­®ŸŸ¯œ›ž¦°±¦Ÿ²œ¦³̈±›´Ÿ§¨¢žµŸ¶Ÿ·ªŸ¡¦›¦³µ¦³¥µŸ¸¨žŸ¹ž¦³̧³¥³›´Ÿº±¦´³́»±¥Ÿ¼Ÿ¯›¦›Ÿ¡¥³±¥
13/06/2025 TE CSE-AIML SEM-V C-SCHEME SAIDS QP CODE: 10083339
  !"#
$%
&'()$&*+,- P[TK3狤狧
.'/)+-))0)1))213)
'/,,0)*-)0,
4'/)5,)671)0)66)*,),-
89:;; <==>?@=;<AB;CDEF; ;;GHIJ;
; K:;LMKN;OP;MQRSNMTPOP;NTPNOUV;W;>XRYKOU;NQRT;Z;KU[;NQRT;ZZ;T\\S\PW;; ;
; ]:;^_`abcdbecd_fghdbfi`jabafdak; ;
; l:;>XRYKOU;NMT;[OmmT\TUlT;]TNnTTU;oN\KNOmOT[;KU[;pYqPNT\;oKrRYOUV:; ;
; [:;>XRYKOU;sOUTK\;FTV\TPPOSU;KU[;ONP;<RRYOlKNOSUP:; ;
; T:;tTmOUT;PNKU[K\[;[TuOKNOSU;KU[;OUNT\vqK\NOYT;\KUVT;nONM;TXKrRYTP:; ;
8H:;K:;COU[;NMT;lS\\TYKNOSU;lSTmmOlOTUN;m\Sr;NMT;VOuTU;[KNK:;; G9IJ;
w5x)* y+))*)z w,-{
9; |; |I;
H; }; ~I;
; 9H; €|;
; 9|; }|;
|; 9}; ‚|;
~; HI; 9I|;
; ]:;LMKN;OP;pMOƒovqK\T;=TPNW;<;\TNKOY;lSrRKUQ;nKUNP;NS;[TNT\rOUT;Om;NMT\T;OP;K;;;G9IJ;
;

POVUOmOlKUN;KPPSlOKNOSU;]TNnTTU;lqPNSrT\;VTU[T\;KU[;R\TmT\TUlT;mS\;SUYOUT;
PMSRROUV;uP:;OUƒPNS\T;PMSRROUV:;=MT;lSrRKUQ;lSYYTlNT[;[KNK;m\Sr;K;\KU[Sr;
PKrRYT;Sm;HII;lqPNSrT\P„;KU[;NMT;\TPqYNP;K\T;PqrrK\OT[;OU;NMT;mSYYSnOUV;
lSUNOUVTUlQ;NK]YT:;EPT;NMT;pMOƒovqK\T;=TPN;mS\;ZU[TRTU[TUlT;NS;[TNT\rOUT;Om;NMT\T;
OP;K;PNKNOPNOlKYYQ;POVUOmOlKUN;KPPSlOKNOSU;]TNnTTU;VTU[T\;KU[;PMSRROUV;R\TmT\TUlT;KN;
K;|†;POVUOmOlKUlT;YTuTY;‡ˆ‰Š‹ŠŒ;
;

Ž)6); )w1)++ ,) )1)‘’w) “,;


2; w++2;
?KYT; ~I; I; 9II;
CTrKYT; €I; I; 9II;
“,; 9I; €I; HII;
8:;K:;;>XRYKOU;NMT;lSUlTRN;Sm;RƒuKYqT;OU;MQRSNMTPOP;NTPNOUV; ;;G9IJ;
; ]; <;PlMSSY;lSU[qlNT[;KU;KRNONq[T;NTPN;mS\;NM\TT;[OmmT\TUN;V\K[TP;”•\K[T;<„;•\K[T;–„;;;G9IJ;
KU[;•\K[T;p—:;=MT;PlS\TP;S]NKOUT[;]Q;NMT;PNq[TUNP;OU;TKlM;V\K[T;K\T;VOuTU:;<N;
K;‚|†;lSUmO[TUlT;YTuTY„;[TNT\rOUT;Om;NMT;PlS\TP;[OmmT\;POVUOmOlKUNYQ;Kl\SPP;NMT;
NM\TT;V\K[TP;qPOUV;NMT;˜\qP™KYƒLKYYOP;NTPN:;
;
;
;
01112 456789

½¾¿¾À¾Á¿ÁÂý¾¿¾À¾Á¿ÁÂý¾¿¾À¾Á¿ÁÂý¾¿¾À¾Á¿ÁÂÃ
\]^_`abacdef_ghaijk_lamnnopabaq_^]`hr_shat^hujs]vaijd`w_axaylach]huwhugwazj`a{`huzugu]va|sh_vvu}_sg_a~aq]h]acgu_sg_
  
  
  
  
  
  
  
!"!#$!%"&'()%&*+$!+!"+"#,!",-"+./!&+"0$!"12'!1!&"#,!-, ;<=


"+./!&+"%"3+$!!42!#+!/52,2.'+%,&61!&%"73&/+$!"+&//
/!8%+%,&%"9'#.'+!+$!):"#,!-,+$%""12'!1!&
 >!"!#$!#,&/.#+!/".8!(,-#,''!*!"+./!&+"+,/!+!1%&!$,?1&(;<=
$,."+$!("2!&/"+./(%&*2!?!!@9!+!-!A.!&#(/%"+%>.+%,&+>'!-,+$!
/+2,8%/!/
<3<33<333<B3<33<<3<3<7373<3<3<3<33<B3<3<<3<33<3
<733<3<3<3<B3<733<<3<33<3<3<3<3<373<B3<3<<3<33<3
<73<3<
2$1#!.+%#'#,12&($"/!8!',2!/&!?/.*+$++$!(#'%1',?!" ;<=
>',,/2!"".!1,!!--!#+%8!'(+$&+$!#. !&+"+&///.*0$!8!*!
!/.#+%,&%&>',,/2!"".!-,2+%!&+"."%&*+$!"+&///.*%"<11C*3?%+$
"+&///!8%+%,&,-11C*0$!#,12&(#,&/.#+"#'%&%#'+%'?%+$B
2+%!&+"."%&*+$!&!?/.*&/,>"!8!"&8!*!!/.#+%,&,-<11C*+
"%*&%-%#&#!'!8!'3&"?!+$!-,'',?%&*D
<EF++!+$!&.''&/'+!&+%8!$(2,+$!"!"
E9'#.'+!+$!+!"+"++%"+%#
G!+!1%&!%-+$!&!?/.*%""++%"+%#''("%*&%-%#&+'(1,!!--!#+%8!+$&+$!
"+&///.*

 >H%&/+$!"%12'!'%&!!*!""%,&!A.+%,&-,+$!*%8!&/+ ;<=
IJK LMNO
B <
7 <
 
< B
< 
< 
7P42'%&+$!#,&#!2+,-+?,:?(QRSC,?/,!"%+/%--!-,1,&!:?(;<=


QRSTG!"#%>!+$!"".12+%,&",-+?,:?(QRS&/$,?(,.?,.'/
#$!#@+$!"!"".12+%,&"'",3>%!-'(!42'%&H%!/1&U"+!"+"&,&:
21!+%#'+!&+%8!
 > V %+!"$,+&,+!",&5&(+?,6 ;<=
<9$%:"A.!/%"+%>.+%,&
V!%>.''/%"+%>.+%,&
BF+!1WX!-Y',+
Z,4Y',+

[[[[[[[[[[[[[[[

01112 4567898

€€‚€ƒƒ„ €€‚€ƒƒ„ €€‚€ƒƒ„ €€‚€ƒƒ„
klmnopqprstunvwpxyzn{p|}}~pqp€nmlown‚wpƒmw„y‚lpxyso†np‡pˆ{prwlw„†w„v†p‰yopŠow„‰„v„lp‹‚wn„Œn‚vnpp€lwlprv„n‚vn
22/11/2024 CSE-AIML SEM-V C SCHEME DLOC-STAT. FOR AIDS QP CODE: 10067675

7955


 8!"#!$%&'()#%*+!
,-../&'.!01+!.2*//!3*%&!.2/!*/&0"1"14!3"5/!6(/#."%1#!
7-##(&/!#(".08)/!90.0:!"3!*/6("*/9!019!#.0./!".!$)/0*)+!
!
8!-../&'.!01+!;!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! ! ! !!!! <!
0=>')0"1!'/*$/1.")/#!019!?%>')%.#!@".2!/>0&')/!
8A))(#.*0./!$/1.*0)!)"&".!.2/%*/&!@".2!0!1/0.!9"04*0&!
$=>')0"1!B())!019!0)./*10."5/!C+'%.2/#"#!@".2!/>0&')/!
9D20.!"#!C"#.%4*0&!E"5/!".#!0'')"$0."%1#!
/C%@!.%!9/./$.!%(.)"/*#F!
3D20.!"#!.2/!GHI/#.!(#/9!3%*F!
!
<! ! ! !! !!! ! ! ! ! ! ! 8!
0D20.!"#!J2"HK6(0*/!I/#.F!-!#.(9+!"#!$%19($./9!.%!/>0&"1/!.2/!*/)0."%1#2"'!8/.@//1!
4/19/*!019!@2/.2/*!0!'/*#%1!'*/3/*#!$%33//!%*!./0!I2/!90.0!$%))/$./9!"#!0#!3%))%@#L!
M7NO7P4777NQ7R9 77759S5T
5T7! 7U! U!;U!
V75T7! ,U! 7U!WU!
9S5T! WU! ;U!XU!
Y#/!$2"H#6(0*/!./#.!.%!3"19!0##%$"0."%1!8/.@//1!4/19/*!019!8/5/*04/!'*/3/*/1$/F!ZY#/!
[\]^]_`^!
!
8=>')0"1!B%*&0)!019!a%"##%1!b"#.*"8(."%1!c%#.!4*09(0./!#$2%%)#!%3!8(#"1/##!*/6("*/!
0'')"$01.#!3%*!09&"##"%1!.%!.0d/!.2/!E*09(0./!c0104/&/1.!-9&"##"%1!J%(1$")e#!
Ec-I!/>0&"10."%1!K$%*/#!%1!.2/!Ec-I!0*/!*%(42)+!1%*&0))+!9"#.*"8(./9!@".2!0!
&/01!%3!W,f!019!0!#.0190*9!9/5"0."%1!%3!,!I20.!"#!.2/!'*%808")".+!%3!01!"19"5"9(0)!
#$%*"14!08%5/!WUU!%1!.2/!Ec-IF!C%@!2"42!&(#.!01!"19"5"9(0)!#$%*/!%1!.2/!Ec-I!
"1!%*9/*!.%!#$%*/!"1!.2/!2"42/#.!WgF!!!!!! ! ! ! ! 8!
!
! ! ! ! !! ! ! ! ! ! ! 8!
0=>')0"1!#"14)/!019!c()."')/!)"1/0*!*/4*/##"%1!@".2!/>0&')/!019!#2%@!@".2!#(".08)/!
')%.!
!
8=>')0"1!.Hb"#.*"8(."%1!"1!9/.0")!I2/!J=h!%3!)"42.!8()8#!&01(30$.(*"14!$%&'01+!
$)0"&#!.20.!0!)"42.!8()8!)0#.#!7UU!90+#!-!*/#/0*$2/*!*019%&)+!#/)/$.#!W!8()8#!./#."14!
I2/!#0&')/9!8()8#!)0#.!01!05/*04/!.201!,XU!90+#F!9/5"0."%1!%3!WU!90+#!A3!.2/!J=he#!
$)0"&!@/*/!.*(/:!@20.!"#!.2/!'*%808")".+!.20.!W!*019%&)+!#/)/$./9!8()8#!@%()9!205/!
01!05/*04/!)"3/!%3!1%!&%*/!.201!,XU!90+#F!!!! ! ! ! 8!
!
i!! ! ! !!!!!!!!!!!! ! ! ! ! ! 8!
0=>')0"1!8*"/3)+!@2+!(#/!-Bhj-F!E"5/!9"33/*/1$/!8/.@//1!%1/H@0+!019!.@%H@0+!
-Bhj-!./#.!K%)5/!.2/!3%))%@"14!(#"14!%1/!@0+!011%50!
!
!
!
!
01012 456789

Ž‘’“’”“•Ž‘’“’”“•Ž‘’“’”“•Ž‘’“’”“•
EFGHIJKJLMNOHPQJRSTHUJVWWXYJKJZHGFIQ[H\QJ]GQ^S\F_JRSMI`HJaJbUJLQFQ^`Q^P`JcSIJdIQ^c^P^F_Je\QH__^fH\PHJgJZFQFJLP^H\PH

  


  
  
  
  
  
  
  
!"#$"%&'"()(($*!'!"%&'"()($)))"( "+!($",$-"'$)*./!"0
'$&1/&*2$&+('!"'&+ &3$.!"(3$"%&'"()(($*!'!"%
&'"()($)))%"(4 ,"+!($",$!"'$)*+)'1$0!*$"&3$ 567
8 ' 9 2$
 
 
 
 
 
 
 
 
 
 
:;2!"1/<")3$')!,#$'1( )$(!++$)$"'+)3)3$')!,3$'1(&567
8$)'3$"'+=2!,>$'1 "(&+$'?3"!')&'1$3$&2)$&'@$"',$"2()!"@!"0
/ '$)/$)$$++$,'!*$A)!13$'1"$&BA>#&C',2"'!$&()!"@!"0/ '$),3)$(
$+)$,$"2%/$$@'$)%"(/$$@& +'$),$"2&*$'1$+/!"02&!"0
'1$.)!$(3 "A$&'D

01012 456789

hijiklmlnmohijiklmlnmohijiklmlnmohijiklmlnmo
cdefghihjklmfnohpqrfshtuuvwhihxfedgoyfzoh{eo|qzd}hpqkg~fhh€shjodo|~o|n~hqgh‚go||n|d}hƒzof}}|„fznfh hxdodhjn|fznf


!"#$%&# '()(
*&#&&+!"%,$ -!."/01)(2
3456786968:7;
"<=>?@>=AB6ACB6D54E=EF>FAG6AC=A6H6>FBI6EBAJBBK6L6=KM6NLO6F;B;O6PQL8H8NLR;
"&+&%&0(22!%S0("
"&+&%,&0T22!%U0T" VWX
Y0Z& [WX
*+*+\\
],#^+'&.#+
_++,`ab#+,
Z
bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb

01012 4567898

†‡ˆ‡‰Š‹ŠŒ‹†‡ˆ‡‰Š‹ŠŒ‹†‡ˆ‡‰Š‹ŠŒ‹†‡ˆ‡‰Š‹ŠŒ‹
F
BE

8B

4C
E
F5
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

43
C5

FE
92

A7
5A
BE
E2

8B

C
3E
1T01865 - T.E. Computer Science & Enginering (Data Science) (Choice Based) (R-2019-20'C' Scheme) SEMESTER - V / 48885 -

EF

74
C2

C5

92

A4
Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

EA
BF
BE
01

E2

5
QP CODE: 10014523 DATE: 02/12/2022

F
8

3
[[Time: 3 Hours]

F7
[ Marks:80]

FE
92

4
C

2C

5A
F0

E
01

B
B
2E

EF
8
C1
N.B. 1. Question No. 1 is compulsory.

F7

C5

2
1C

A
BF
4C
2. Attempt any three questions out of remaining five.

BE
E2

F5
F

70

28
C1
A7
3. All questions carry equal marks

C5

FE
0F

1C

E9
C
3E
4. Assume Suitable data, if required and state it clearly.

E2

8B
1F
4

5B
7
A4

F7

92
CC
EA

2C
F5

F0

BE
1
74
3

70

2E

28
FE

C1
4

C5
A
A
Q.1 Attempt any four: 20

0F

1C

E9
8B

C
E
F5

E2
a) Find the standard deviation of the average temperatures recorded over a

F
74
43

70

5B
E

C1
92

C2
A
A
five-day period last winter: 19, 21, 18, 24, 12?

2C
BE

8B

C
E
5

F0

01
F
b) X is a normally distributed variable with mean μ = 30 and standard deviation

74
3

2E
C5

FE

1
2

F7
C
9

EA
A
σ = 4. Find:

1C
BE
E2

4C
5

F0

E2
F
i) P (x < 40), ii) P (30 < x < 35)
35)?

70
C2

C1
92

A7
A4

C2
C

BF
c) Discuss Boot strapping vs. rere-sampling

0F
E
01

E2

4C
E
F5

01
B

1F
d) The
he school principal wants to test if it is true what teachers say – that high

43
F7

C5

FE
2

A7

F7
C

CC
E9

5A
school juniors use the computer an average 3.2 hours a day. What are our
F0

8B

3E

F0
0

5B
E

EF
null and alternative hypotheses?

74
C1

F7

C1
2

A4
1C

2C

EA
BF
e) What do you mean by correlation and regression? Explain with example
4C

F0

BE

4C
F5
0

2E

28

43
C1
A7

C5

FE

A7
0F

E9

5A
4C
3E

E2

8B

3E
1F

5B

EF
Q.2 a) Find
ind the value of the correlation coefficient from the data given in the 10
A7
A4

F7

92

A4
C

2C

BF
following table:
C
3E
F5

BE
01

F5
F
4

2E

28
FE

C1
A7
A4

C5

FE
F

E9
8B

SUBJECT AGE (X) GLUCOSE LEVEL(Y)


3E
F5

E2

8B
1F
4

70

5B
FE
2

A7
A4

1 43 99
C2

92
C
E9

2C
8B

4C
E
F5

2 21 65

BE
01
5B

1F
3

2E
FE
2

A7
4

F7

3 25 79

C5
2C

CC
E9

1C
8B

3E
F5

F0

4 42 E2 75
5B
2E

74

70
FE

C1
2

A4

5 57 87
C2
1C

2C

E9

EA

0F
8B

4C
F5

01

6 59 81
70

5B
2E

1F
3
FE
2

A7
4

F7
0F

1C

CC
E9

5A
E2

8B

3E

F0
70

b)
5B

10
EF

74
2

C1
2

A4
F

1C

Explain briefly why ANOVA is used? Solve using One


One-way
way ANOVA
2C

E9

EA
BF
F0

4C
F5
70

5B
2E

28

43
C1

FE

A7
F

1C

2C

E9

5A
4C

F0

8B

3E
70

5B
2E

EF
C1
A7

92

A4
F

1C

2C

BF
4C

F0

F5
70

B
2E

28
C1
7

C5

FE
EA

0F

1C

E9
4C

E2

8B
1F
43

70

5B
7

C2

92
CC
EA
A

0F

2C
F5

BE
01
1F
74
43

2E
F7

C5
CC
EA
5A

C
F0

E2
EF

74
43

70
C1

C2
EA
5A
BF

0F

method:
4C

01
F

1F
28

43
FE

F7
CC
E9

EA
5A
8B

F0
F

74
43
FE

C1
92

EA
A
BE

8B

4C
F5

43
C5

FE
92

A7

14523 Page 1 of 3
A
BE
E2

8B

3E
F5
C2

C5

FE
92

A4
BE
E2

8B

F5
C2

C5

F701C2E2C5BE928BFEF5A43EA74CC1F0
FE
92
F
BE

8B

4C
E
F5
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

43
C5

FE
92

A7
5A
QP CODE: 10014523

BE
E2

8B

C
3E
EF

74
C2

C5

92

A4

EA
BF
BE
01

E2

F 5
8

3
Q.3 a) Explain type I & type 2 error in detail.

F7
10

FE
92

4
C

2C

5A
F0
(ii) What is the use of scatter plot and box plot?

E
01

B
B
2E

EF
8
C1
b) In
n a manufacturing unit, four teams of operators were randomly selected and 10

F7

C5

2
1C

A
BF
4C

0
sent to four different facilities for machining techniques training. After the

BE
E2

F5
F

70

28
C1
A7
training, the supervisor conducted the exam and recorded the test scores. At

C5

FE
0F

1C

E9
C
3E
95% confidence level does the scores are same in all four facilities?

E2

8B
1F
4

5B
7
A4

F7
(Hint: Use Kruskal–Wallis
Wallis test)

92
CC
EA

2C
F5

F0

BE
1
74
3

70

2E

28
FE

C1
4

C5
A
A

0F

1C

E9
8B

C
E
F5

E2
F
74
43

70

5B
E

C1
92

C2
A
A
F

2C
BE

8B

C
E
5

F0

01
F

74
3

2E
C5

FE

1
2

F7
C
9

EA
A

1C
BE
E2

4C
5

F0

E2
F
8

70
C2

C1
92

A7
A4

C2
C

BF

0F
E
01

E2

4C
Q.4 a) Iff the sample mean and expected mean value of th thee marks obtained by 15 10

E
F5

01
B

1F
8

43
F7

C5

FE
2

A7
students in a class test is 290 and 300 respectively. What is the tt-score
score if the

F7
C

CC
E9

5A
F0

8B
standard deviation of the marks is 50?
5

3E

F0
0

5B
E

EF

74
C1

F7

C1
2

A4
b) Find
ind out what is the relation between the GPA of a class of students and the 10
1C

2C

EA
BF
4C

F0

BE

4C
number of hours of study and the height of the student

F5
0

2E

28

43
C1
A7

C5

FE

A7
0F

E9

5A
4C
3E

E2

8B

3E
1F

5B

EF
A7
A4

F7

92

A4
C

2C

BF
C
3E
F5

BE
01

F5
F
4

2E

28
FE

C1
A7
A4

C5

FE
F

E9
8B

C
3E
F5

E2

8B
1F
4

70

5B
FE
2

A7
A4

C2

92
C
E9

2C
8B

4C
E
F5

BE
01
5B

1F
3

2E
FE
2

A7
4

F7

C5
2C

CC
E9

1C
8B

3E
F5

F0

E2
5B
2E

74

70
FE

C1
2

A4

C2
1C

2C

E9

EA

0F
8B

4C
F5

01
70

5B
2E

1F
3
FE
2

A7
4

F7
0F

1C

CC
E9

5A
E2

8B

3E

F0
70

5B

EF

74
2

C1
2

A4
F

1C

2C

E9

EA
BF
F0

Q.5 a) A farmer is trying out a planting technique that he hopes will increase the 10
4C
F5
70

5B
2E

28

43
C1

yield on his pea plants. The average number of pods on one of his pea plants
FE

A7
F

1C

2C

E9

5A
4C

F0

is 145 pods with a standard deviation of 100 pods. This year, after trying his
8B

3E
70

5B
2E

EF
C1
A7

new planting technique, he takes a random sample of his plants and finds the
92

A4
F

1C

2C

BF
4C

F0

average number of pods to be 147. He wonders whether this is a statistically


F5
70

B
2E

28
C1
7

significant increase. What are his hypotheses and the test statistic? Use a
C5

FE
EA

0F

1C

E9
4C

0.055 significance level.


E2

8B
1F
43

70

5B
7

C2

b) Find
ind the simple linear regression equation that fits the given data and 10
92
CC
EA
A

0F

2C
F5

BE
01

coefficient of determination:
1F
74
43

2E
F7

C5

Hour Temp
CC
EA
5A

C
F0

E2

2 21
EF

74
43

70
C1

C2
EA
5A

4 27
BF

0F
4C

01

6 29
F

1F
28

43
FE

F7
CC
E9

EA
5A

8 86
8B

F0
F

74
43

10 86
FE

C1
92

EA
A

12 92
BE

8B

4C
F5

43
C5

FE
92

A7

14523 Page 2 of 3
A
BE
E2

8B

3E
F5
C2

C5

FE
92

A4
BE
E2

8B

F5
C2

C5

F701C2E2C5BE928BFEF5A43EA74CC1F0
FE
92
C2 E9 F5 A7 0F
E2 28 A43
4C 70
1C
C5 BF C1
BE EF EA F0 2E
C2
E2 92 5A 7 4C F 70 2C
C5 8B 43 C1 1C 5B
FE EA F0 2E E9
BE
92 F 5A 7 F 2C 2 8B
4C 70 5B FE
C5 8B 43 C1 1C
BE FE EA F0 2 E9
2
F5
92 F 5A 74 F E2
C 8B A4
Q.6

8B CC 70 5B FE 3E
43 1C A7

14523
FE EA 1F 2E E9 F5
a)

b)
92 F5 74 0F 2C 2 8B A4 4C
8B A 43 CC 70
1C 5B FE 3E C1
F0
FE EA 1F E9 F5 A7 F7
2E 2 4C
F5 7 0F 2C 8B A4 01
A 4C 70 3E C1
years if

43 1C 5B C2 FE F0
C1 E9 E2 F5 A7
FE EA F0 2E 4 F7
F5 A 4 C 2
0 1
C5
74 F7 2C 8B 3 C C BE
A4
3E
CC 01 5B FE E A7
1F 2 E2 92
1F C2 E9 F5 0F
A7 0F E2 28 A4 4 C 7 0 C 5
8B
FE
iii. Standard Error
4C 70 C5 BF 3E C1 1C BE
1 C B F 0 2 E 9 2
F5
C1 E EF A7 2 A4
i. Confidence Interval

F0 2E 92 5A 4C F7 8B
F7 2C 8B 4 C 0 1
C5
B F E
3E
3 C
ii. Central Limit Theorem

01 5B FE EA 1F 2E E F5 A7
Write short notes on (any two)

C2 E9 F5 74 0 F 2C
92
8 A 4
4C
E2 28 7 B 3 C1
i. All five people are still living.

A4 CC 01 5B FE EA
C5 BF 3E 1F C 2 E9 F 7
F0
BE EF A7 0 E2 2 5A 4C
F7
F 8 01
ii. At least three people are still living.

92 5A 4C 70 C5 BF 43 C1
8B 43 1 C E E A F C2
0

Page 3 of 3
C1 BE F E2
FE EA F0 2E 9 2 5A
74 F7 C5
F5 74 F7 2C 8B 43
CC 01
C BE
A4 CC 01 5B FE E A
1F 2
õõõõõõõõ-

3E 1F C2 E9 F5 74 0F E2
A7 0F E2 28 A4 C 70 C5

F701C2E2C5BE928BFEF5A43EA74CC1F0
4C 70
1C
C5 BF 3 EA
C1
F0
1C
2E
BE
C1 BE EF 92
F0 2E 92 5A 74 F7 2C 8B
F7 2C 8B 43 C C1 0 1C 5 BE F
01 5B FE E A7 F 2 E2 92
C2 E9 F5 0F 8B
E2 28 4C 70 C5
iii. Exactly two people are still living. (Hint: Binomial Distribution)

A4 1 B
C5 BF 3E C 1 C E
FE
BE EF A7 F0 2 E2 92 F5
92 5A 4C F 70 C5 8 BF
According to recent data, the probability of a person living in these

8B
conditions for 30 years or more is 2/3. Calculate the probability that after 30

43 C1 1C BE EF
FE EA F0 2E 92 5A
F5 8
10
An agent sells life insurance policies to five equally aged, healthy people. 10

74 F7 2C B 43
A4 CC 01 5B FE E
3E 1F C2 E9 F 5
A7 0F E2 28
QP CODE: 10014523

A4
4C 70 C5 BF 3E
C1 1C BE EF A7
F0 2E 92 5A 4C
F7
01
2C 8B 4 3
5B FE EA
Paper / Subject Code: 48895 / Department Optional Course - 1: Statistics for Artificial Intelligence & Data Science

C2 E9 F5 74
E2 28 A C

You might also like