Lecture 09 Anova
Lecture 09 Anova
BIOSTATISTICS
ANOVA
LECTURE # 09
BY,
Graphical demonstration:
Employing two types of
variability
ANOVA – NULL AND ALTERNATIVE HYPOTHESES
Say the sample contains K independent groups
1. ANOVA tests the null hypothesis
H0: μ1 = μ2 = … = μk
• That is, “the group means are all equal”
2. The alternative hypothesis is
H1: μi ≠ μj for some i, j
• Or, “the group means are not all equal”
• K= no of treatments, n= no of observations
3. ANOVA Table
Between k
k-1 MSSB MSSB/
Groups = 1/n Ti.2 − (T ..) 2 / nK = SS/df MSSw
(SSB) i =1
Total TSS n k
(T ..) 2 Kn - 1
= X ij −
2
j =1 i =1 nK
• 4. Level of Significance, α = ?
• 5. Critical region:
ν1 = k-1
ν2 = k(n-1)
F> F α, ν1, ν2
or
35 12 60 53 29
32 27 33 29 31
28 41 36 42 22
14 19 31 40 36
47 23 40 23 29
25 31 43 35 42
38 20 48 42 30
Total
Mean
A B C D E Total
21 35 45 32 45
35 12 60 53 29
32 27 33 29 31
28 41 36 42 22
14 19 31 40 36
47 23 40 23 29
25 31 43 35 42
38 20 48 42 30
Mean 30 26 42 37 33 168
Ho : µA = µB = µC = µD = µE (There is no significant difference between 5 popular
brands of cigarettes)
HA : At least 2 brands of cigarettes are not same (There is significant difference
between 5 popular brands of cigarettes)
α = 0.05
n k 2
(T ..)
TSS = X ij −
2
j =1 i =1 nK
= [ (21)2 + (35)2 + (45)2 + (32)2 + (45)2 + (35)2 + (12)2 + (60)2 + (53)2 + (29)2 + (32)2 + (27)2 +
(33)2 + (29)2 + (31)2 + (28)2 + (41)2 + (36)2 + (42)2 + (22)2 + (14)2 + (19)2 + (31)2 + (40)2 + (36)2 +
(47)2 + (23)2 + (40)2 + (23)2 + (29)2+ (25)2 + (31)2 + (43)2 + (35)2 + (42)2 + (38)2 + (20)2 + (48)2 +
(42)2 + (30)2 ] – (1344)2
40
= 49370 – 1806336
40
= 49370 – 45158.4 TSS = 4211.6
k
SSB = 1/n Ti .
i =1
2
− (T ..) 2
/ nK
CONCLUSION:
Fcal lies in critical region so we reject the null hypothesis and conclude that at
least two means are not same i.e., there is a significant difference between 5
popular brands of cigarettes.
Critical
Values of the F
Distribution:
f0.05(ν1,ν2)
Critical Values
of the F
Distribution:
f0.05(ν1,ν2)
Critical Values
of the F
Distribution:
f0.01(ν1,ν2)
Critical Values
of the F
Distribution:
f0.01(ν1,ν2)
PROBLEM # 02:
• Question# 03 on page 400 of book “introduction to statistics by Ronald E.
Walpole, 3rd edition”.
Six different machines are being considered for use in manufacturing
rubber seals. The machines are being compared with respect to tensile
strength of the product. A random sample of 4 seals from each machine is
used to determine whether or not the mean tensile strength varies from
machine to machine. The following are the tensile strength measurements
in kilograms per square centimeter x 10-1.
Perform the analysis of variance at the 0.05 level of significance and
indicate whether or not the treatment means differ significantly.
MACHINES Total
1 2 3 4 5 6
Total
Mean
CONCLUSION:
Fcal lies in acceptance region so we accept the null hypothesis and conclude that the
means are same i.e., there is a no significant difference between tensile strength from
machine to machine.
ANOVA Table For Unequal Number of Observations
Source of Sum of Squares df MSS (mean 4. F=
variation sum of sq)
Between k
k-1 MSSB MSSB/
Groups = (Ti.2 ni) − (T ..) 2 / N = SS/df MSSw
i =1
(SSB)
Within TSS-SSB Σni - k MSSW MSSB/
Groups =SSw/df MSw
(MSE)
ni k
(T ..) 2 Σni - 1
= X ij
Total TSS
−
2
j =1 i =1 ni
• Critical region:
ν1 = k-1
ν2 = Σni - k
PROBLEM # 03:
• Example # 02 on page 394 of book “Introduction to statistics by
Ronald E. Walpole, 3rd edition”.
It is suspected that higher-priced automobiles are assembled
with greater care than lower priced automobiles. To investigate
whether there is any basis for this feeling, a large luxury model
A, a medium-size sedar B and a sub compact hatch back C were
compared for defects when they arrived at the deal’s showroom.
All cars were manufactured by the same company. The number
of defects for several of the three models are recorded in table.
Test the hypothesis at the 0.05 level of significance that the
average number of defects is the same for the three models.
NUMBERS OF AUTOMOBILE DEFECTS
MODELS
A B C TOTAL
4 5 8
7 1 6
6 3 8
6 5 9
3 5
4
TOTAL
MEAN
CONCLUSION:
Fcal lies in critical region so we reject the null hypothesis and conclude that at least two means are not same i.e.,
there is a significant difference between the average number of defects for the three models of automobile.
PROBLEM 04
• Question # 04 on page 400 of book “Introduction to statistics by Ronald
E. Walpole, 3rd edition”.
Three sections of the same elementary mathematics course are taught by
3 teachers. The final grades were recorded as follows:
Is there a significant difference in the average grades given by the 3
teachers? Use a 0.05 level of significance.
TEACHERS
A B C TOTAL
73 88 68
89 78 79
82 48 56
43 91 91
80 51 71
73 85 71
66 74 87
60 77 41
45 31 59
93 78 68
36 62 53
77 76 79
96 15
80
56
TOTAL
MEAN
CONCLUSION:
Fcal lies in acceptance region so we accept the null hypothesis and conclude
that at least two means are same i.e., there is a no significant difference
between the average number of grades given by three teachers.