Mostly Harmless Statistics Formula Packet: Chapter 3 Formulas
Mostly Harmless Statistics Formula Packet: Chapter 3 Formulas
Chapter 3 Formulas
∑𝑥 ∑𝑥
Sample Mean: 𝑥̅ = Population Mean: 𝜇 =
𝑛 𝑁
∑(𝑥𝑤)
Weighted Mean: 𝑥̅ = ∑𝑤
Range = Max – Min
∑(𝑥−𝑥̅ )2
Sample Standard Deviation: 𝑠 = √ Population Standard Deviation = σ
𝑛−1
∑(𝑥−𝑥̅ )2
Sample Variance: 𝑠 2 = Population Variance = σ2
𝑛−1
𝑠 𝑥−𝑥̅
Coefficient of Variation: CVar = ( ∙ 100) % Z-Score: 𝑧 =
𝑥̅ 𝑠
(𝑛+1)∙𝑝
Percentile Index: 𝑖 = Interquartile Range: IQR = Q3 – Q1
100
1
Chebyshev’s Inequality: ((1 − (𝑧)2) ∙ 100) % Outlier Upper Limit: Q3 + (1.5·IQR)
TI-84: Enter the data in a list and then press [STAT]. Use cursor keys to highlight CALC. Press 1 or [ENTER] to
select 1:1-Var Stats. Press [2nd], then press the number key corresponding to your data list. Press [Enter] to
calculate the statistics. Note: the calculator always defaults to L1 if you do not specify a data list.
sx is the sample standard deviation. You can arrow down and find more statistics. Use the min and max to calculate
the range by hand. To find the variance simply square the standard deviation.
Press [F4], select 1: 1-Var Stats. To get the list name to the List box, press [2nd] [Var-
Link], arrow down to list1 and press [Enter]. This will bring list1 to the List box. Press
[Enter] to enter the list name and then enter again to calculate.
3 4 5 6 7 8 9
4 5 6 7 8 9 10
5 6 7 8 9 10 11
6 7 8 9 10 11 12
Chapter 5 Formulas
Discrete Distribution Table:
Discrete Distribution Mean: μ = Σ(xi ∙ P(xi))
0 ≤ P(xi) ≤ 1 ∑ P(xi) = 1
Discrete Distribution Variance: Discrete Distribution Standard Deviation:
σ2 = ∑(xi2∙P(xi)) – μ2 σ = √𝜎 2
1
Geometric Distribution Mean: 𝜇 =
𝑝
1−𝑝
Geometric Distribution: Variance: σ2 =
𝑝2
P(X = x) = p ∙ q(x – 1), x = 1, 2, 3, …
1−𝑝
Standard Deviation: σ = √
𝑝2
Binomial Distribution Mean: μ = n ∙ p
Binomial Distribution: Variance: σ2 = n ∙ p ∙ q
P(X = x) = nCx·px·q(n-x), x = 0, 1, 2, … , n
Standard Deviation: σ = √𝑛 ∙ 𝑝 ∙ 𝑞
Hypergeometric Distribution: p = P(success) q = P(failure) = 1 – p
𝐶 ∙ 𝐶
P(X = x) = 𝑎 𝑥 𝑏 𝑛−𝑥 n = sample size N = population size
𝑁𝐶𝑛
Chapter 6 Formulas
Uniform Distribution:
1 Exponential Distribution:
𝑓(𝑥) = , for 𝑎 ≤ 𝑥 ≤ 𝑏 1
𝑏−𝑎
1 𝑓(𝑥) = 𝑒 (−𝑥/𝜇) , for 𝑥 ≥ 0
𝜇
P(X ≥ x) = P(X > x) = ( ) ∙ (𝑏 − 𝑥)
𝑏−𝑎 P(X ≥ x) = P(X > x) = e–x/μ
1
P(X ≤ x) = P(X < x) = ( ) ∙ (𝑥 − 𝑎) P(X ≤ x) = P(X < x) = 1 – e–x/μ
𝑏−𝑎
P(x1 ≤ X ≤ x2) = P(x1 < X < x2) = P(x1 ≤ X ≤ x2) = P(x1 < X < x2) =
1 𝑒 (−𝑥1 /𝜇) − 𝑒 (−𝑥2 /𝜇)
( ) ∙ (𝑥2 − 𝑥1 )
𝑏−𝑎
Standard Normal Distribution:
μ = 0, σ = 1 Central Limit Theorem:
𝑥−𝜇 𝑥̅ −𝜇
z-score: 𝑧 = Z-score: 𝑧 = 𝜎
𝜎 ( )
√𝑛
x = zσ + μ
Excel
Excel Finding a Probability Excel
Finding a Probability =NORM.DIST(x2,µ,σ,true) – Finding a Probability
=NORM.DIST(x,µ,σ,true) NORM.DIST(x1,µ,σ,true) =1–NORM.DIST(x,µ,σ,true)
Finding a Percentile Finding a Percentile Finding a Percentile
=NORM.INV(area,µ,σ) x1 =NORM.INV((1–area)/2,µ,σ) =NORM.INV(1–area,µ,σ)
x2 =NORM.INV(1–((1–area)/2),μ,σ)
TI Calculator
TI Calculator TI Calculator
Finding a Probability
Finding a Probability Finding a Probability
=normalcdf(x1,x2,µ,σ)
=normalcdf(-1E99,x,µ,σ) =normalcdf(x,1E99,µ,σ)
Finding a Percentile
Finding a Percentile Finding a Percentile
x1 =invNorm((1–area)/2,µ,σ)
=invNorm(area,µ,σ) =invNorm(1–area,µ,σ)
x2 =invNorm(1–((1–area)/2),μ,σ)
Chapter 7 Formulas
Confidence Interval for One Proportion Sample Size for Proportion
𝑧 ⁄ 2
𝑝̂𝑞̂
𝑝̂ ± 𝑧𝛼/2 √( ) 𝑛 = 𝑝∗ ∙ 𝑞 ∗ ( 𝛼 2 )
𝐸
𝑛
𝑥 Always round up to whole number.
𝑝̂ = 𝑞̂ = 1 − 𝑝̂ If p is not given use p* = 0.5.
𝑛
TI-84: 1-PropZInt E = Margin of Error
Confidence Interval for One Mean z- Confidence Interval
Use z-interval when σ is given. 𝜎
𝑥̅ ± 𝑧𝛼/2 ( )
Use t-interval when s is given. √𝑛
If n < 30, population needs to be normal. TI-84: ZInterval
Z-Critical Values t-Critical Values
Excel: zα/2 =NORM.INV(1–area/2,0,1) Excel: tα/2 =T.INV(1–area/2,df)
TI-84: zα/2 = invNorm(1–area/2,0,1) TI-84: tα/2 = invT(1–area/2,df)
t-Confidence Interval, df = n – 1; TI-84: TInterval 𝑧 ⁄ ∙𝜎 2
𝑠
𝑥̅ ± 𝑡𝛼⁄2 ( ) Sample Size for Mean 𝑛 = ( 𝛼 2 )
𝐸
√𝑛
Always round up to whole number.
TI-84: TI-84:
Two-tail: zα/2 = invNorm(1–α/2,0,1) Two-tail: tα/2 = invT(1–α/2,df)
Right-tail: z1–α = invNorm(1–α,0,1) Right-tail: t1–α = invT(1–α,df)
Left-tail: zα = invNorm(α,0,1) Left-tail: tα = invT(α,df)
Hypothesis Test for One Proportion
H0: p = p0 Rejection Rules:
H1: p ≠ p0 P-value method: reject H0 when the p-value ≤ α.
𝑝̂−𝑝
𝑧 = 𝑝 𝑞0 Critical value method: reject H0 when the test statistic
√( 0 0 ) is in the critical region (shaded tails).
𝑛
TI-84: 1-PropZTest
TI-84: 2-SampZTest
Hypothesis Test for Two Independent Means Confidence Interval for Two Independent Means
H0: µ1 = µ2 T-Interval: Assume variances are unequal
H1: µ1 ≠ µ2 𝑠12 𝑠22
T-Test: Assume variances are unequal (𝑥̅1 − 𝑥̅2 ) ± 𝑡𝛼⁄2 √( + )
𝑛1 𝑛2
(𝑥̅ −𝑥̅ )−(𝜇1 −𝜇2 )0
𝑡= 1 2 TI-84: 2-SampTInt
𝑠 2𝑠 2 2 2 2
√( 1 + 2 ) 𝑠 𝑠
( 1+ 2)
𝑛1 𝑛2 𝑛1 𝑛2
𝑑𝑓 =
TI-84: 2-SampTTest 𝑠2
2
1 𝑠2
2
1
2 2 2 (( 1 ) ( )+( 2 ) ( ))
𝑠 𝑠 𝑛1 𝑛1 −1 𝑛2 𝑛2 −1
( 1+ 2)
𝑛1 𝑛2
𝑑𝑓 = 2 2
T-Interval: Assume variances are equal
𝑠2 1 𝑠2 1
(( 1 ) ( )+( 2 ) ( ))
𝑛1 𝑛1 −1 𝑛2 𝑛2 −1 (𝑛1 −1)𝑠12 +(𝑛2 −1)𝑠22 1 1
(𝑥̅1 − 𝑥̅2 ) ± 𝑡𝛼⁄2 √(( )( + ))
(𝑛1 +𝑛2 −2) 𝑛1 𝑛2
T-Test: Assume variances are equal
(𝑥̅1 −𝑥̅2 )−(𝜇1 −𝜇2 )
𝑡= df = n1 – n2 – 2
2
(𝑛 −1)𝑠 +(𝑛2 −1)𝑠2 2
1 1
√( 1 (𝑛 1 )( + )
1 +𝑛2 −2) 𝑛1 𝑛2
df = n1 – n2 – 2
Hypothesis Test for Two Proportions Confidence Interval for Two Proportions
H0: p1 = p2 𝑝̂1 𝑞̂1 𝑝̂2 𝑞̂2
(𝑝̂1 − 𝑝̂2 ) ± 𝑧𝛼/2 √( + )
H1: p1 ≠ p2 𝑛1 𝑛2
(𝑝̂ −𝑝̂ )−(𝑝1 −𝑝2 ) 𝑥1 𝑥2
𝑧= 1 2 𝑝̂1 =
𝑛1
𝑝̂2 =
𝑛2
1 1
√(𝑝̂∙𝑞̂( + )) 𝑞̂1 = 1 − 𝑝̂1 𝑞̂2 = 1 − 𝑝̂2
𝑛1 𝑛2
(𝑥 +𝑥2 ) (𝑝̂1 ∙𝑛1 +𝑝̂2 ∙𝑛2 ) TI-84: 2-PropZInt
𝑝̂ = (𝑛1 =
1 +𝑛2 ) (𝑛1 +𝑛2 )
𝑥 𝑥2
𝑞̂ = 1 − 𝑝̂ 𝑝̂1 = 1 𝑝̂2 =
𝑛 1 𝑛2
TI-84: 2-PropZTest
Hypothesis Test for Two Variances Hypothesis Test for Two Standard Deviations
𝐻0 : 𝜎12 = 𝜎22 𝐻0 : 𝜎1 = 𝜎2
𝐻1 : 𝜎12 ≠ 𝜎22 𝐻1 : 𝜎1 ≠ 𝜎2
𝑠12 𝑠12
𝐹= dfN = n1 – 1, dfD = n2 – 1 𝐹= dfN = n1 – 1, dfD = n2 – 1
𝑠22 𝑠22
Chapter 11 Formulas
One-Way ANOVA: H0: µ1 = µ2 = µ3 = … = µk k = number of groups
H1: At least one mean is different.
Source SS = Sum of Squares df MS = Mean Square F
𝑆𝑆𝐵 𝑀𝑆𝐵
Between (Factor) ∑ 𝑛𝑖 (𝑥̅𝑖 − 𝑥̅𝐺𝑀 )2 k–1 𝑀𝑆𝐵 = 𝐹=
𝑘−1 𝑀𝑆𝑊
𝑆𝑆𝑊
Within (Error) ∑(𝑛𝑖 − 1)𝑠𝑖2 N–k 𝑀𝑆𝑊 =
𝑁−𝑘
Total SST N–1
𝑥̅𝑖 = sample mean from the ith group 𝑠𝑖2 = sample variance from the ith group
∑𝑥
ni = sample size of the ith group N = n1 + n2 + ∙∙∙ + nk 𝑥̅𝐺𝑀 = 𝑖
𝑁
𝑥̅𝑖 −𝑥̅𝑗 H0 : μi = μj
Bonferroni test statistic: 𝑡 =
1 1
H1 : μi ≠ μj
√(𝑀𝑆𝑊( + ))
𝑛𝑖 𝑛𝑗
Column Effect (Factor B): H0: The column variable has no effect on the average .
H1: The column variable has an effect on the average .
Source SS df MS F
𝑆𝑆𝐴 𝑀𝑆𝐴
A (row factor) SSA a–1 𝑀𝑆𝐴 =
𝑑𝑓𝐴
𝐹𝐴 =
𝑀𝑆𝐸
𝑆𝑆𝐵 𝑀𝑆𝐵
B (column factor) SSB b–1 𝑀𝑆𝐵 =
𝑑𝑓𝐵
𝐹𝐵 =
𝑀𝑆𝐸
𝑆𝑆𝐴×𝐵 𝑀𝑆𝐴×𝐵
A×B (interaction) SSA×B (a – 1) (b – 1) 𝑀𝑆𝐴×𝐵 =
𝑑𝑓𝐴×𝐵
𝐹𝐴×𝐵 =
𝑀𝑆𝐸
𝑆𝑆𝐸
Error (within) SSE ab(n – 1) 𝑀𝑆𝐸 =
𝑑𝑓𝐸
Total SST N–1
Chapter 12 Formulas
SSxx = (𝑛 − 1)𝑠𝑥2 Correlation Coefficient
SSyy=(𝑛 − 1)𝑠𝑦2 𝑟=
𝑆𝑆𝑥𝑦