0% found this document useful (0 votes)

13 views6 pages

Probability and Statistics Concepts Explained

The document discusses various concepts in probability and statistics, including properties of events, random variables, and inequalities. It covers topics such as the addition of probabilities, variance, independence of random variables, and specific inequalities like Markov and Chebyshev. Additionally, it presents algorithms for determining the type of coin based on tosses and explores regret bounds in decision-making scenarios.

Uploaded by

yowebis418

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Topics covered

Expectation properties,
Statistical bounds,
Chebyshev inequality,
Fixed point theorem,
Statistical proofs,
Hoeffding inequality,
Random variable properties,
Probability theory,
Random variables,
Expectation

0% found this document useful (0 votes)

13 views6 pages

Probability and Statistics Concepts Explained

Uploaded by

yowebis418

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Topics covered

Expectation properties,
Statistical bounds,
Chebyshev inequality,
Fixed point theorem,
Statistical proofs,
Hoeffding inequality,
Random variable properties,
Probability theory,
Random variables,
Expectation

Rahul Jha (210802) January 21, 2025 1

1 1 Marks
1. For any two events A and B, P (A ∪ B) = P (A) + P (B) - P (A ∩ B).
A: We know from the definition of probability that for any two disjoint events A, B we
have P (A ∪ B) = P (A) + P (B).

A = (A ∩ B) ∪ (A ∩ B c )
P (A) = P (A ∩ B) + P (A ∩ B c )
(A ∪ B) = ((A ∪ B) ∩ B) ∪ (A ∩ B c ) = B ∪ (A ∩ B c )
P (A ∪ B) = P (B) + P (A ∩ B c )
P (A ∪ B) = P (B) + P (A) − P (A ∩ B)
Where equation 5 is due to equations 2, 4. Note that equations 1, 3 break down a set into
a union of two disjoint sets. ,
2. For any two random variables X, Y , E[X + Y ] = E[X] + E[Y ].
A: I use the following definition of discrete expectation E[X] = ΣxP (X = y) where the
summation is over the image of the sample space under X. WLOG, we can assume that
both X, Y are defined in the same sample space. Also, ΣP (x = X) = 1. Now,
E[X + Y ] = Σy Σx (x + y)P (X = x, Y = y)
E[X + Y ] = Σy Σx xP (X = x, Y = y) + Σy Σx yP (X = x, Y = y)
E[X + Y ] = Σx xΣP (X = x, Y = y) + Σy yΣx P (X = x, Y = y)
E[X + Y ] = Σx xP (X = x) + Σy yP (Y = y)
E[X + Y ] = E[X] + E[Y ]
,
3. Var(cX) = c2 Var(X) where c is a constant and X is a random variable.
A: Using the definition of variance, for a random variable X, Var(X) = E(X − E[X])2 .
Also, by linearity of expectation, we have E[cX] = cE[X]. Now,
Var(cX) = E(cX − E[cX])2
Var(cX) = E(cX − cE[X])2 = Ec2 (X − E[X])
Var(cX) = c2 E(X − E[X])2
Var(cX) = c2 Var(X)
,
4. Prove that with probability atleast 1 − k1 , a uniformly random permutation σ : [n] → [n]
Rahul Jha (210802) January 21, 2025 2

has atmost k fixed points.

A: Consider the case where a permutation has atleast k + 1 fixed points. The probability
of that happening can be calculated as
n!
1. Choosing k + 1 numbers in n Ck+1 , ie (k+1)!(n−k−1)!
ways.

2. Permuting the remaining (n − k − 1) numbers in (n − k − 1)! ways.

n! 1
3. The total permutations then are (k+1)!
. The probability is (k+1)!

1
We therefore have P (atmost k fixed points) = 1 − (k+1)!
.

Proving the bound is trivial, and it goes like k < (k + 1)! =⇒ k1 > (k+1)! 1
=⇒ 1 − k1 <
1 1
1 − (k+1)! . Finally, P (atmost k fixed points) = 1 − (k+1)! > 1 − k1 . ,
5. Consider a particle that does an unbiased random walk on real line. It starts at position
0. For any i, if the particle is at i, it moves to position i + 1 with probability 12 and to
position i1 with probability 12 . Prove that after n steps, with at least 1 √10n probability, the
√
distance of the particle from start, i.e., 0 is at most nlnn.
A: We can write the position as Sn = ΣXi , where √ for each i −1 ≤ Xi ≤ 1, and E[Xi ] = 0.
The problem now reduces to finding √ P (|Sn | ≤− 2n lnn
n lnn). We can simply √ use Hoeffding’s
√
−ln n
inequality which√ gets us P (|Sn | ≥ n lnn) ≤ 2e 4n , which is P (|S | ≥
n n lnn) ≤ 2e ,
i,e P (|Sn | ≥ n lnn) ≤ √2n .
√
The desired probability is then P (|Sn | ≤ n lnn) ≥ 1 − √2n ≥ 1 − √10n .
,

2 2 Marks
1. If X, Y are independent random variables then E[XY ] = E[X]E[Y ].
A: If X, Y are independent random variables, then the joint distribution function factorizes
as P (X = x, Y = y) = P (X = x)Y (y = y). Now,

E[XY ] = ΣΣxyP (X = x, Y = y)
E[XY ] = ΣΣxyP (X = x)P (Y = y)
E[XY ] = ΣxP (X = x)ΣyP (Y = y)
E[XY ] = E[X]E[Y ]
Rahul Jha (210802) January 21, 2025 3

QED. ,
2. E[X] = P (A)E[X|A] + P (Ac )E[X|Ac ]
A: Conditioning on A, P (X) = P (A)P (X|A) + P (Ac )P (X|Ac ). Now,

This is as required. ,
3. If X1 , X2 , . . . , Xn are independent random variables then V ar(ΣXi ) = ΣV ar(Xi ).
A: WLOG, E[Xi ] = 0. Using the first problem, we can do the following.

V ar(ΣXi ) = E(ΣXi )2
=⇒ E(ΣXi2 + Σi̸=j Xi Xj )
=⇒ ΣEXi2 + Σi̸=j E[Xi Xj ]
=⇒ ΣEXi2 + Σi̸=j E[Xi ]E[Xj ]
=⇒ ΣEXi2 = ΣV ar(Xi ).

We can make the initial assumption because we can always define Yi = Xi − E[Xi ]. ,.
4. Prove Markov inequality.
A: We start as follows

E[X] = ΣxP (X = x)
≥ Σx≥c cP (X = x)
≥ cΣx≥c P (X = x)
≥ cP (X ≥ c)
E[X]
P (X ≥ c) ≤
c
E[X]
Now, choosing c = cE[X], we get P (X ≥ cE[X]) ≤ cE[X] ≤ 1c as required. ,
5. Prove Chebyshev’s inequality.
A. First, we note that the event |X − E[X]| ≥ t is equivalent to the event (X − E[X]) ≥ t2 .
Now using the transformation Y = (X − E[X])2 we are interested in P (Y ≥ t2 ). By
definition, Y only takes non negative values and we can use Markov’s inequality. We have
P (Y ≥ t2 ) ≤ E[Y
t2
]
, which is V ar(X)
t2
as required. ,.
6. Show that pairwise independence does not imply independence.
Rahul Jha (210802) January 21, 2025 4

A. Can show directly using a contradiction. Consider a 4 sided die, and consider the events
A = {1, 2}, B = {2, 3} and C = {2, 4}. Clearly, P (AB) = P (A)P (B). P (AC) = P (A)P (C)
and P (BC) = P (B)P (C) but P (ABC) ̸= P (A)P (B)P (C). ,.
7. Compare Markov, Chebyshev and Hoeffding Bounds.
A. Define the random variable Xi as 1 if the ith throw is 6 else 0 and S = ΣXi . Clearly
S ∼ Binomial(1/6, N ). We know E[S] = N/6 and V ar(S) = 5N/36 using properties of
Binomial distribution. Now,
1. Markov Bound : P (S ≥ N/4) = P (S ≥ 3/2E[S]) ≤ 2/3. The bound is indepen-
dent of N .
Var(S)
2. Chebyshev Bound: P (|S − E[S]| ≥ N/12) ≤ N 2 /144
≤ 20/N . The bound becomes
stronger as N increases.

3. Hoeffding Bound: P (|S − E[S]|) ≥ N/12) ≤ 2e−N/72 . The bound decreases expo-
nentially as N increases.

3 3 Marks
1. It is promised that a given coin is either fair (Pr(Head) = 21 ) or biased with Pr(Head) =
1
2
+ ϵ where 0 < ϵ < 12 . Show that 100 ϵ2
coin tosses are sufficient to correctly determine the
type of coin (fair or biased) with at least 54 probability, i.e., give an algorithm that will
need at most 100
ϵ2
coin tosses, and should have the following guarantee: if the coin is fair, the
algorithm will return ‘fair’ with probability at least 54 , and if the coin is biased, the algorithm
will return ‘biased’ with probability at least 45 .
A. Consider the following algorithm.
1. Define δ = 2ϵ . Toss the coin 100
ϵ2
times, and let the ith outcome be represented as Xi .
Clearly, if we define Heads to be 1, tails to be 0, Xi ∈ [0, 1] and is therefore a bounded
r.v.

2. Compute X = ΣXi . Now, if p = X/n > 0.5 + δ, declare the coin biased, else declare
the coin fair.

3. Note that pf air = 0.5 and pbiased = 0/5 + ϵ

I claim that this rule works, and the proof goes as
1. Coin Is Fair : The strategy returns biased with the probability P (|p − pf air | ≥
200ϵ2
ϵ/2), which by Hoeffdings’ inequality is bounded by P (|p − pf air | ≥ ϵ/2) ≤ 2e− 4 .
Rahul Jha (210802) January 21, 2025 5

2
Therefore P (|p − pf air | ≥ ϵ/2) ≤ e50
≤ 1/5. Therefore the strategy returns fair with
atleast probability 4/5.

2.
A. Let N = 100000 log 1δ , and the probability of correct estimation p ≥ 2/3. Repeating the
algorithm N times, let Xi be the random variable that is 1 if the ith algorithm estimates the
mean to be within ϵ distance, and 0 otherwise. Consider S = ΣXi . If we can show that
S ≥ N/2 we are done because then the median necessarily estimates the mean correctly.
Now consider the following proof that with N as defined, this is indeed the case.

P (S < N/2) ⇐⇒ P (|S − E[S]| ≥ N/6) because if p ≥ 2/3, S < N/2 ⇐⇒ (S − E[S]) ≥ N/6.
N2
P (|S − E[S]| ≥ N/6) ≤ 2e−2 36N by Hoeffding’s Inequality
N2 100000 ln δ
e−2 36N = e 18 ≤ δ 1000 which for small δ is less than δ

This proves that the median is an incorrect estimate with probability atmost δ. ,.
3.
A. Intuitively, the regret should simply scale by x, otherwise it will be possible to get a
better regret bound in the [0, 1] rewards case by simply scaling the rewards. The formal
proof goes as follows.

1. m is the number of exploration rounds for each arm, and T is the total number of
rounds (fixed).
2ϵ2 n
2. As rewards ∈ [0, x], the Hoeffding inequality takes the form P (|µˆa − µa | ≥ ϵ) ≤ 2e− x2
for all arms a.
q
2
3. To set RHS ≤ T 10 we can set ϵ = 5x mln T . This immediately gives us P (Bad) ≤ T19
1

where Bad means the previous event holds for all arms.

4. Now, E[R(T )] = P (Bad)E[R(T )|Bad] + P (Good)E[R(T )|Good], where the Bad event
is the event described in the previous bullet. By lecture notes, we have E[R(T )] ≤
2x
T8
+ E(R(T )|Good) where we take the worst case regret in case of Bad.
Rahul Jha (210802) January 21, 2025 6

5. To calculate E(R(T )|Good), we set the regret to x during exploration phase and µ∗ −µa
for the rest of the rounds.

E(R(T )|Good) = mKx + (T − mK)(µ∗ − µa )

mKx + (T − mK)(µ∗ − µa ) ≤ mKx + T (µ∗ − µa ) ≤ mKx + 2T ϵ
r
5 ln T
E(R(T )|Good) ≤ mKx + x
r m
5 ln T
E(R(T )|Good) ≤ x(mK + )
m

This is just the regret in the [0, 1] case multiplied by x. So the regret bound is O(xT 2/3 K 1/3 (log T )1/3 )
,
4.
A.

a. If both arms have a mean of 1/2, then the regret of choosing any arm is 0. The best
upper bound is 0. We can set the exploration rounds to 0 in this case.
1/3 1/3
b. Under the Good event, the maximum regret is 1000m( (lnTT1/3 )
)+(T −2m)1000( (lnTT1/3
)
)
2/3 1/3
which at m = T /3 becomes O(T (ln T ) . P (Bad) can be made vanishingly small.
√
c. If one arm has a mean of 1/2 and the other of √ 1/2+1/ T . Under the Good event,
√ the
regret even if the smaller arm is chosen is m/ T +P (1/2 arm is chosen)(T −2m)/ T =
O(T 1/2 ). The probability Bad can be made arbitrarily small. The regret bound then
is O(T 1/2 )

Chernoff Bounds in Random Variables
No ratings yet
Chernoff Bounds in Random Variables
4 pages
Review Questions on Probability and Statistics
No ratings yet
Review Questions on Probability and Statistics
5 pages
MIT Probabilistic Systems Analysis Solutions
No ratings yet
MIT Probabilistic Systems Analysis Solutions
8 pages
XGBoost and Bernstein Inequalities Analysis
No ratings yet
XGBoost and Bernstein Inequalities Analysis
16 pages
Techniques of the Probabilistic Method
No ratings yet
Techniques of the Probabilistic Method
9 pages
Probability Concepts in Drug Testing
No ratings yet
Probability Concepts in Drug Testing
5 pages
Probability and Statistics Homework Solutions
No ratings yet
Probability and Statistics Homework Solutions
7 pages
UC Berkeley CS 174 Problem Set 3 Solutions
No ratings yet
UC Berkeley CS 174 Problem Set 3 Solutions
7 pages
2021 College Math Contest Solutions
No ratings yet
2021 College Math Contest Solutions
5 pages
Advanced Stochastic Processes Assignments
No ratings yet
Advanced Stochastic Processes Assignments
12 pages
CS 174 Problem Set 2 Solutions
No ratings yet
CS 174 Problem Set 2 Solutions
6 pages
McDiarmid's Inequality Explained
No ratings yet
McDiarmid's Inequality Explained
9 pages
Hoeffding Inequality and PLA Explained
No ratings yet
Hoeffding Inequality and PLA Explained
9 pages
EC404 Winter 2023 Statistics Problem Set
No ratings yet
EC404 Winter 2023 Statistics Problem Set
9 pages
EECS 126 Problem Set 2 Solutions
No ratings yet
EECS 126 Problem Set 2 Solutions
10 pages
Probability Problem Set Solutions
No ratings yet
Probability Problem Set Solutions
14 pages
Stochastic Modeling Homework 1 Guide
No ratings yet
Stochastic Modeling Homework 1 Guide
6 pages
Probability Theory and Random Variables Concepts
No ratings yet
Probability Theory and Random Variables Concepts
10 pages
Statistical Estimation Problem Set
No ratings yet
Statistical Estimation Problem Set
11 pages
Machine Learning Homework Solutions
No ratings yet
Machine Learning Homework Solutions
5 pages
McDiarmid's Inequality Explained
No ratings yet
McDiarmid's Inequality Explained
22 pages
Probabilistic Methods in Combinatorics
No ratings yet
Probabilistic Methods in Combinatorics
7 pages
Probabilistic Methods in Combinatorics
No ratings yet
Probabilistic Methods in Combinatorics
7 pages
PDF and CDF of Random Variables Analysis
No ratings yet
PDF and CDF of Random Variables Analysis
1 page
Probability Exam Review Spring 2014
No ratings yet
Probability Exam Review Spring 2014
11 pages
Statistics Homework Solutions 100A
No ratings yet
Statistics Homework Solutions 100A
11 pages
Probability Assignment: Random Processes
No ratings yet
Probability Assignment: Random Processes
2 pages
Randomized Algorithms Analysis Essentials
No ratings yet
Randomized Algorithms Analysis Essentials
8 pages
Mathematical Expectation and Variance
No ratings yet
Mathematical Expectation and Variance
39 pages
Probability Theory Concepts and Proofs
No ratings yet
Probability Theory Concepts and Proofs
19 pages
Variance and Independence in Probability
No ratings yet
Variance and Independence in Probability
19 pages
Von Neumann's Random Sampling Method
No ratings yet
Von Neumann's Random Sampling Method
21 pages
Properties of Expectation and Covariance
No ratings yet
Properties of Expectation and Covariance
5 pages
Hoeffding and McDiarmid Inequalities
No ratings yet
Hoeffding and McDiarmid Inequalities
9 pages
MATH3901: Probability & Stochastic Processes
No ratings yet
MATH3901: Probability & Stochastic Processes
42 pages
Conditional Expectation and Random Walk Solutions
No ratings yet
Conditional Expectation and Random Walk Solutions
4 pages
STAT 333 Assignment 1 Solutions
No ratings yet
STAT 333 Assignment 1 Solutions
6 pages
Understanding Randomised Algorithms
No ratings yet
Understanding Randomised Algorithms
30 pages
Probability and Statistics Practice Problems
No ratings yet
Probability and Statistics Practice Problems
8 pages
Probability III Exam Solutions 2025
No ratings yet
Probability III Exam Solutions 2025
16 pages
Solutions to Chapter 1 of Statistics
No ratings yet
Solutions to Chapter 1 of Statistics
7 pages
Machine Learning Foundations: Probability Questions
No ratings yet
Machine Learning Foundations: Probability Questions
13 pages
Probability and Random Variables Overview
No ratings yet
Probability and Random Variables Overview
23 pages
Probability Solutions for Homework 4
No ratings yet
Probability Solutions for Homework 4
6 pages
Solutions for EE126 Discussion 4
100% (1)
Solutions for EE126 Discussion 4
4 pages
Stat 244 Winter 2024 Problem Set 6
No ratings yet
Stat 244 Winter 2024 Problem Set 6
6 pages
MIT Probabilistic Systems Analysis Solutions
No ratings yet
MIT Probabilistic Systems Analysis Solutions
8 pages
Independence of Events in Probability
No ratings yet
Independence of Events in Probability
9 pages
Gaussian Random Variables and Markov Chains
No ratings yet
Gaussian Random Variables and Markov Chains
4 pages
UCSD ECE153 Homework Set #4 Solutions
No ratings yet
UCSD ECE153 Homework Set #4 Solutions
11 pages
Analyzing the Siegel Paradox in Economics
No ratings yet
Analyzing the Siegel Paradox in Economics
8 pages
Midterm Exam: Probability & Statistics
No ratings yet
Midterm Exam: Probability & Statistics
6 pages
Probability and Statistics Study Guide
No ratings yet
Probability and Statistics Study Guide
6 pages
Information Theory Homework 3 Solutions
No ratings yet
Information Theory Homework 3 Solutions
2 pages
EC404 Monsoon 2016 Problem Set Summary
No ratings yet
EC404 Monsoon 2016 Problem Set Summary
4 pages
Variance and Expectation in Probability
No ratings yet
Variance and Expectation in Probability
5 pages
ME3491 Theory of Machines Course Plan
0% (1)
ME3491 Theory of Machines Course Plan
9 pages
Understanding Forces and Motion
No ratings yet
Understanding Forces and Motion
26 pages
B.Sc. Nautical Science Syllabus 2011
100% (1)
B.Sc. Nautical Science Syllabus 2011
39 pages
Historical Analysis of Masonry Dams
No ratings yet
Historical Analysis of Masonry Dams
51 pages
SRM Student Registration List
No ratings yet
SRM Student Registration List
4 pages
MB Series Dry Bushings Specifications
No ratings yet
MB Series Dry Bushings Specifications
5 pages
El o Matic Posiflex Positioners en Us 6415414
No ratings yet
El o Matic Posiflex Positioners en Us 6415414
20 pages
Candid Icse Physics 10
93% (15)
Candid Icse Physics 10
85 pages
Evaluating Capillary Number Definitions
No ratings yet
Evaluating Capillary Number Definitions
8 pages
Wormhole: Topological Spacetime
No ratings yet
Wormhole: Topological Spacetime
14 pages
Geopolymer Concrete Beam Analysis
No ratings yet
Geopolymer Concrete Beam Analysis
8 pages
JEE Advanced Kinematics 50 Extreme MCQ
No ratings yet
JEE Advanced Kinematics 50 Extreme MCQ
10 pages
Finite Element Method Lab with LUSAS
No ratings yet
Finite Element Method Lab with LUSAS
2 pages
IB Math HL Paper 3: Sets & Groups Exam
No ratings yet
IB Math HL Paper 3: Sets & Groups Exam
4 pages
Energy-Efficient GU10 LED Lamps Guide
No ratings yet
Energy-Efficient GU10 LED Lamps Guide
1 page
Brazing & Soldering
100% (4)
Brazing & Soldering
468 pages
7-DOF Robot Inverse Kinematics Analysis
No ratings yet
7-DOF Robot Inverse Kinematics Analysis
11 pages
Hump Height for Critical Flow Analysis
No ratings yet
Hump Height for Critical Flow Analysis
57 pages
FEE 2025: Engineering Exam Guide
No ratings yet
FEE 2025: Engineering Exam Guide
24 pages
Ideal Gas Cycle Analysis for JEE
No ratings yet
Ideal Gas Cycle Analysis for JEE
3 pages
Machine Foundation Digital Assignment 2: (1) Types of Machines, Capacity, Operating Frequency
100% (3)
Machine Foundation Digital Assignment 2: (1) Types of Machines, Capacity, Operating Frequency
40 pages
Ribbed Slab Design Manual ACI 318-19
No ratings yet
Ribbed Slab Design Manual ACI 318-19
11 pages
Introduction to Mathematical Algorithms
100% (1)
Introduction to Mathematical Algorithms
34 pages
Test Bank For Chemistry, 8Th Edition, Jill Kirsten Robinson, John E. Mcmurry, Robert C. Fay
No ratings yet
Test Bank For Chemistry, 8Th Edition, Jill Kirsten Robinson, John E. Mcmurry, Robert C. Fay
83 pages
Hydraulics and Fluid Mechanics Overview
No ratings yet
Hydraulics and Fluid Mechanics Overview
42 pages
Understanding Normal Distribution
No ratings yet
Understanding Normal Distribution
45 pages
Vac 10
No ratings yet
Vac 10
2 pages
AQA GCSE Physics Wave Properties Summary
No ratings yet
AQA GCSE Physics Wave Properties Summary
4 pages
CHAYANIKA MTech 2013-15
No ratings yet
CHAYANIKA MTech 2013-15
62 pages
Flexural Members Design and Analysis
No ratings yet
Flexural Members Design and Analysis
43 pages

Probability and Statistics Concepts Explained

Uploaded by

Probability and Statistics Concepts Explained

Uploaded by

Rahul Jha (210802) January 21, 2025 1

has atmost k fixed points.

2. Permuting the remaining (n − k − 1) numbers in (n − k − 1)! ways.

3. Note that pf air = 0.5 and pbiased = 0/5 + ϵ

E(R(T )|Good) = mKx + (T − mK)(µ∗ − µa )

You might also like