0% found this document useful (0 votes)

55 views3 pages

Set 4 IBM-322

The document contains 5 questions about statistical concepts such as Chebyshev's inequality, linear regression, logistic regression, confidence intervals, probability distributions, and expectation and variance. The questions involve applying these statistical concepts to data scenarios and calculating related probabilities, coefficients, and intervals.

Uploaded by

Ayush Kushwah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views3 pages

Set 4 IBM-322

Uploaded by

Ayush Kushwah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

\_y

Xu*., Roll number:

Q,l [10 marks] A one-dimensional dataset of 1 lakh points has the mean as 100 and yaruance
as 9. Chebyshev's inequality states that atleast points are contained in this
interval*182, 1181
h=6, 1" LL- *-goTnU
Q.2 I10 marksl We fit a linear regression model on some dataset. One of the independent
variables is Temperature, measured in Kelvin (K). The coefficient of this variable
comes out
to be 2000. If the temperature was measured in Fahrenheit (o F), the coefficient of
this variable
wouldhave come outto be t I I I " I ?

uF
[Hint: -915 (K -273)+32]

Q.3 [10 marks] A salesperson has visited 1000 customers to sell a book in a city. He has the
data of several attributes of these customers (Income, education level, interest
in ieading, etc.).
Finally, he has also maintained the record of who purchased the book or not. He uses
iogistic
regression model to buitd a model for this data. Assume that the cost
of the book is Rs.i00,
selling price of the book is Rs. 3000, average cost of visiting a customer is Rs. 200.
The salesperson gets a new list of 10000 customers and the attributes used in building
the
model' For a customer, the probability of purchasing the book comes out as ,.p,,.
The
salesperson decides to visit a customer only if the e*pe.ted value of visiting
the cusiomer is
more than Rs. 800. For what values of "p" should the salesperson visit a customer.
z3oo1 tt *t) 2o o. s oo
- )
Q.4 t10 marksl In the last offering, IBM-3r, ,o,$3 r?#o with 200 srudents. 160 srudents
gave the ETE, others have dropped the course. If we were to give
an estimate of the proportion
of students who drop IBM-322 course, and the class .un b. considered u, u ,.pi.sentatire
sample for the same, what is the unbiased estimate for the proportion
of studenis who drop
IBM-322 course. Construct a95o/o confidence interval for the proportion of students
who droo
rB*-322course'
Q= 0,L CT* p*at_,ry g (. irrq ;.;d)
Q'5 t5 marksl Let W be a random variable that follows chi-square distribution with degrees
of freedom: 15. what is the expectation of w? Derive the result. v
6tvc), 1g-
Q'6 [5 marks] True/False (no expl.anation required): A box plot is always a better way to
describe the data as compared to a histogram.
filt*
Q.7 [10 marks] Consider a 3-sided dice with letters A,B,C written on the 3 faces. We are
interested in estimating the probability of these letters appearing
when the dice is rolled. The
dice is rolled l0 times. The observed outcome is A,A,B,A,B,A,d,A,A,B
What is the maximum likelihood estimate of probability of ihese'letters
appearing? you may
use the notations pa, pe and pc for these probabilities.

$tuav and briefly show all the steps, just writing the final answer is not sufficient.
fDon't worry auoultlel.Tl#rytricgl shape Jf u g-rid.d dice, assume such a dice exists]
Ylt-6"*- PAt. f*. f.
btrEL$
E-u.,
F. A !

ry*&j3ry-€
fg
3 &(._
"r*ffi

flA=#; ?s* -'2 i Vr- J

lB '/
5
le
Name :

Q. 8- [5 +10 marks] There is a six sided fair dice (with numb.i"ittr?t"T' th. six faces). Let -
ECX') f
I

an experiment be rolling the dice 100 times. V {r-\.= ZEty)

a 5c + b L\tiu/:) .

Denote by
rrxil arafidom variable which counts the number of times ah evb,ffnumber appears
in these 100 attemPts. EC=) lSo -
,,Y", a random variable which counts the number of times a number 1::t,l}^:fl"d
Denote by
e CnvtK'U
totwocomesinthese 100attempts. Va,rrC")= rlahtn)+ 1.voJ.t\) t
Let Z * X+3Y
.Ia.r.Cr)= eg -f q"Y + (or(rc,9
CrrC'n,f ) - ECXv)- Ec).) gc*r) ,=- 6
Find E(Z) and Variance(Z)
. Var.C-)= {S-+{ooto =-2Q{
* an event to raise awareness about mental health.
e.9 [10 + 10 marks] SAC wants to organize
pirt rrut.ly, SAC hasteen able to obtain some funding to provide unlimited Kaju-Katli(Indian
;*;;0 i; tire students who come to the event. SAC now needs to get an idea about how many
Kaju-Katli will be consumed during the event.
past exp.erience,
Assume there are 10000 students who have the possibility of coming. From
the student will come
we know that if we randomly pick a student, there ts a So/a chance that
to the event. So, the number ofstudents who come to the event can be modelled
by a Binomial
distribution. Etf):$ooX3:fgoo r 4
Var tr)= Ecr.r ).Va.r" (x) -+- [+^): .Yartni/
Also, model the number of Kaju-Katli i'stuae"t *ili'#nJ.rliv a-polsson d(tribution
r'vith

mean:3. Val.tf): FOO XZ + ?LXA$€

){*HP*ffiffi*S"rs the total number of Kaju-Katlis that will
Let T denote the random
consumed during the event.

Find the EXPECTATION and VARIANCE of random variable T'

is thought that avanable Y is dependent on

e.lg - t15 marksl Consider a scenario where it
a variable X.

Usually, the hypothesised linear regression model is -

E(Y):alpha+beta*X
wants to propose a new
Mr. Kamal Mohan from IIT Delhi believes that life is not constant and
relationship between y and X without the constant term and with the
X2 term. Mr. Mohan also
has strange reasons to believe that the coefficient.of X*2 would be
half of coefficient of X
.The hypothesised relationship as per Mr. Mohan is
E(Y) : beta*X + (betalz)*xz
is the best estimate
Using the same criteria of minimi zingthe sum of lQuare of residuals, what
as xi and yi ]
of beta? [Assume that there are 'on" fioints in the data, use the notations for them
&[x'] *+$] ffi
{>
4
p ,.€- tf
?l

A {xY+* Y;)^U*
'i 2l
Roll number:
Name:
be random variables X and Y distributed indePendentlY and
Q.11 [10+5+5 marks] Let there
having the following distribution :

X * Normal(mean : Z},vatiance:25)
:
Htq
Y - Normai(mean : 100,variance 36)
LetUandVberandomvariableswhicharegivenasfollows:

u:Y +2x ECU)= [10

ECV) = 92-
,,_{X, probabtltty -A.6
u: 6-[u)=,"6(
Y,
probabiti.ty 0.4
t - 12-- /O S
and Y with probability 0'4
In simple terms, v will be X, with probability 0'6

Find the Expectation of the random variables u and v. Arso, find the standard deviation of the
random variable U-

of v' only the shape needs to be

plot rough sketch of the PDF (probability density function) y-axis.
a
correct. Don,t worry about the exact values on
the X-axis and [simulation may help
herel

model was built to understand if comrption level

(cL) can be
e.12 [5+5 marks] A regression
explained using rlr capita Income(PCI) and
Litglacv Rate(LR) of a country' The model was
was -
inierestingly a very good fit to the data' The model

E(CL) = 100 - 0.0005 * PCI - l'2* LR

You can assume that there are no multlcollinearity issuesffX'm$tpmrr b ?4#-*
a.) Give the interpretation of the coefficient of PCl.
;.i ffiffiH#;r.,iu";, ,o ,.0,,e the corupti#n*fl&.11flv?if,ffireasing.
,----^L r t>
-^^J- +^
to by how much LR needs to
the

Literacy Rate. Can the model provide an exact answer r{

,.rri*.ih. targetiin ttte expected sense). If yes, what is the answer' If
- ^-^^_-.^-
;;; i"
no, why not? ,Its'

distributed' We draw a sample of 6

e.13 [10 marksl Assume that the populalign-t: i::rylly
Construct a symmetnc 99a/o confidence
observations. They turn out to be - 12,,4,6,8,!0,12\'
i;;ri'fb. it . poprtation mean. F= + tr= !-fu +- q'03

professor Sumit Nagar bertt]rln"j very few sludents (less than 20%) think
e.14 [10 marks] hypothesis]'
tilrt going to the gy*i5 a waste of tirie [Put this in the alternative
gets the data.of 500 students with 90
To test this, professor Nagar,decided to coflect data. I-Ie
expressing that going to gym is a waste of time.
whai is the p-varue associated with this
hypothesistest.
r/b-ld_e_=o.tzlo .

Statistical Analysis Exam Questions
No ratings yet
Statistical Analysis Exam Questions
3 pages
Set 1 IBM-322
No ratings yet
Set 1 IBM-322
3 pages
Set 3 IBM-322
No ratings yet
Set 3 IBM-322
3 pages
IBM322 MTE 22 Feb
No ratings yet
IBM322 MTE 22 Feb
4 pages
ECO 201 Papers With Solutions (23 Batch)
No ratings yet
ECO 201 Papers With Solutions (23 Batch)
21 pages
PNS Compre
No ratings yet
PNS Compre
3 pages
2015 - 2016 Introduction To Statistics and Probability
No ratings yet
2015 - 2016 Introduction To Statistics and Probability
6 pages
Statistics Paper
No ratings yet
Statistics Paper
12 pages
MAKAUT Question Paper GIVEN BY KKS
No ratings yet
MAKAUT Question Paper GIVEN BY KKS
4 pages
M.SC - Statistics or - 2020
No ratings yet
M.SC - Statistics or - 2020
12 pages
340 s23 Final
No ratings yet
340 s23 Final
7 pages
UCS410
No ratings yet
UCS410
2 pages
Ssmda Pyq
No ratings yet
Ssmda Pyq
16 pages
B.A. H Economics Intermedi Bikup2y2023
No ratings yet
B.A. H Economics Intermedi Bikup2y2023
32 pages
M.SC - Statistics - 2013
No ratings yet
M.SC - Statistics - 2013
12 pages
Probability and Statistics1
No ratings yet
Probability and Statistics1
52 pages
Statistics Questation Paper For H S Final Examination 2022
No ratings yet
Statistics Questation Paper For H S Final Examination 2022
12 pages
Probability and Statistics
No ratings yet
Probability and Statistics
52 pages
MLESA v2024 Week10 Assignment Solution
No ratings yet
MLESA v2024 Week10 Assignment Solution
7 pages
R07 Set No. 2: 5. (A) If A Poisson Distribution Is Such That P (X 1) - P (X 3) - Find
No ratings yet
R07 Set No. 2: 5. (A) If A Poisson Distribution Is Such That P (X 1) - P (X 3) - Find
8 pages
Mod I-II - III - Study Material BL 4 - 5 - 6
No ratings yet
Mod I-II - III - Study Material BL 4 - 5 - 6
7 pages
Actl 20025101 Finalexamsolutions 2006
No ratings yet
Actl 20025101 Finalexamsolutions 2006
15 pages
MSC - Statistics - 2014
No ratings yet
MSC - Statistics - 2014
12 pages
M.SC - Statistics - 2021
No ratings yet
M.SC - Statistics - 2021
13 pages
Solutions To Final1
No ratings yet
Solutions To Final1
12 pages
MDU University BTech Mathematics PyQs
No ratings yet
MDU University BTech Mathematics PyQs
29 pages
P&s Model Paper
No ratings yet
P&s Model Paper
4 pages
ST104b - Statistics 2 - 2014 Exam - Zone-A
No ratings yet
ST104b - Statistics 2 - 2014 Exam - Zone-A
30 pages
MAT2337 December 2010 Final Exam
No ratings yet
MAT2337 December 2010 Final Exam
11 pages
AI HL Revision Worksheet-Statistics and Probability
No ratings yet
AI HL Revision Worksheet-Statistics and Probability
30 pages
13 Jan S1 Red
No ratings yet
13 Jan S1 Red
9 pages
MA 4114 - Probability & Statistical Method Question Paper
100% (1)
MA 4114 - Probability & Statistical Method Question Paper
6 pages
Important Instructions To The Candidates:: Part B
No ratings yet
Important Instructions To The Candidates:: Part B
7 pages
Question 1
No ratings yet
Question 1
23 pages
2011 CAPE Applied Math P1
100% (3)
2011 CAPE Applied Math P1
9 pages
Example Questions For Final
No ratings yet
Example Questions For Final
9 pages
Statistics & Econometrics Problems
No ratings yet
Statistics & Econometrics Problems
4 pages
M.SC - Statistics - 2010
No ratings yet
M.SC - Statistics - 2010
13 pages
Solutions RVCE AIML Test 2
No ratings yet
Solutions RVCE AIML Test 2
5 pages
STA 4322 Exam 1 Soln
No ratings yet
STA 4322 Exam 1 Soln
6 pages
3k Kertaus Stat B Markscheme
No ratings yet
3k Kertaus Stat B Markscheme
30 pages
02 02 2013 Statistical Compiler ST13
No ratings yet
02 02 2013 Statistical Compiler ST13
103 pages
University of Hyderabad PH.D - Statistics - 2012
No ratings yet
University of Hyderabad PH.D - Statistics - 2012
8 pages
Statistical Inference (18BS2T59) - End-Term Exam - 2020-2021
No ratings yet
Statistical Inference (18BS2T59) - End-Term Exam - 2020-2021
3 pages
Exam Solution 3
No ratings yet
Exam Solution 3
6 pages
January 2016B
No ratings yet
January 2016B
7 pages
Rr311801 Probability and Statistics
No ratings yet
Rr311801 Probability and Statistics
8 pages
B.A. P Basic Statistics For Econ 3gp3l47
No ratings yet
B.A. P Basic Statistics For Econ 3gp3l47
16 pages
Regular
No ratings yet
Regular
8 pages
Probability and Statistics-2023
No ratings yet
Probability and Statistics-2023
4 pages
Statistics Question Bank
No ratings yet
Statistics Question Bank
4 pages
MATH 376 - Final Exam Sample Solutions: 1 2 M 1 2 N I 1 2 1 I 2 2 2
No ratings yet
MATH 376 - Final Exam Sample Solutions: 1 2 M 1 2 N I 1 2 1 I 2 2 2
8 pages
2019 Statistic
No ratings yet
2019 Statistic
27 pages
Be - Artificial Intelligence and Data Science - Semester 4 - 2024 - May - Statistics Pattern 2019
No ratings yet
Be - Artificial Intelligence and Data Science - Semester 4 - 2024 - May - Statistics Pattern 2019
4 pages
MAKAUT Math 2023
No ratings yet
MAKAUT Math 2023
2 pages
Btech Cse 3 Sem Mathematics 2011
No ratings yet
Btech Cse 3 Sem Mathematics 2011
8 pages
The Central Limit Theorem and Hypothesis Testing Final
100% (1)
The Central Limit Theorem and Hypothesis Testing Final
29 pages
Module A
No ratings yet
Module A
43 pages
Methods of Psychology
No ratings yet
Methods of Psychology
35 pages
Nassim Taleb Risk Book
100% (3)
Nassim Taleb Risk Book
99 pages
Interpersonal Intelligence Study
No ratings yet
Interpersonal Intelligence Study
10 pages
Statistics Module-1 3rd-22-23 Assessment
No ratings yet
Statistics Module-1 3rd-22-23 Assessment
1 page
Frekuensi Distribusi Skripsi Amel
No ratings yet
Frekuensi Distribusi Skripsi Amel
5 pages
Probability and Statistics Problems
No ratings yet
Probability and Statistics Problems
14 pages
Sampling & Distributions Assignment
No ratings yet
Sampling & Distributions Assignment
5 pages
EQI Gappy ch5 20240528
No ratings yet
EQI Gappy ch5 20240528
25 pages
FACULTYENGAGEMENTRELTEST
No ratings yet
FACULTYENGAGEMENTRELTEST
38 pages
Study on Brand Equity of Pondicherry Spinning Mills
No ratings yet
Study on Brand Equity of Pondicherry Spinning Mills
48 pages
Cambridge Assessment International Education
No ratings yet
Cambridge Assessment International Education
24 pages
Business Statistics Practice Test
100% (1)
Business Statistics Practice Test
14 pages
Solving Binomial Distribution Using MS Excel 1
No ratings yet
Solving Binomial Distribution Using MS Excel 1
16 pages
Sebenta - Empirical Methods For Finance - Vasco Tamen Master's Course
No ratings yet
Sebenta - Empirical Methods For Finance - Vasco Tamen Master's Course
34 pages
How To Specify Estimate and Validate Higher-Order Constructs
No ratings yet
How To Specify Estimate and Validate Higher-Order Constructs
15 pages
Tree and Forest Measurement Book
No ratings yet
Tree and Forest Measurement Book
190 pages
Principles of Statistical Inference
100% (10)
Principles of Statistical Inference
236 pages
Soderstrom T., Stoica P. System Identification (PH 1989) (ISBN S
100% (6)
Soderstrom T., Stoica P. System Identification (PH 1989) (ISBN S
637 pages
Probd
No ratings yet
Probd
49 pages
Business Statistics Consolidated Assignment-2 - 10th February 22
No ratings yet
Business Statistics Consolidated Assignment-2 - 10th February 22
10 pages
Maths 2A Senior
50% (4)
Maths 2A Senior
14 pages
4.91 Master of Management Systme MMS Sem I and II
No ratings yet
4.91 Master of Management Systme MMS Sem I and II
51 pages
Course Outline - Business Statistics
No ratings yet
Course Outline - Business Statistics
7 pages
Data Science Cheat Sheet
No ratings yet
Data Science Cheat Sheet
7 pages
Homework Assignment 2
No ratings yet
Homework Assignment 2
2 pages
Understanding Standard Deviation & Variance
No ratings yet
Understanding Standard Deviation & Variance
4 pages
Ugc Net Economics Paper II Solved D0110
No ratings yet
Ugc Net Economics Paper II Solved D0110
8 pages
Chapter 10 Powerpoint
No ratings yet
Chapter 10 Powerpoint
47 pages

Set 4 IBM-322

Uploaded by

Set 4 IBM-322

Uploaded by

\_y

Xu*., Roll number:

flA=#; ?s* -'*2 i Vr-* *J*

an experiment be rolling the dice 100 times. V {r-\.= ZEty)

mean:3. Val.tf): FOO XZ + ?LXA$€

Find the EXPECTATION and VARIANCE of random variable T'

is thought that avanable Y is dependent on

Usually, the hypothesised linear regression model is -

u:Y +2x ECU)= [10

of v' only the shape needs to be

model was built to understand if comrption level

E(CL) = 100 - 0.0005 * PCI - l'2* LR

Literacy Rate. Can the model provide an exact answer r{

distributed' We draw a sample of 6

You might also like

flA=#; ?s* -'2 i Vr- J