0% found this document useful (0 votes)

81 views26 pages

Understanding ANOVA: Key Concepts

ANOVA (analysis of variance) allows researchers to compare the means of three or more groups. It partitions the total variation in the data into variation between groups and variation within groups. The ANOVA F-statistic is the ratio of between-group variation to within-group variation. A large F-statistic indicates the between-group variation is larger than expected by chance, suggesting the group means are significantly different. Researchers then conduct follow-up tests to determine which specific group means are different.

Uploaded by

Muhammad Javed Iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views26 pages

Understanding ANOVA: Key Concepts

Uploaded by

Muhammad Javed Iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

ANOVA:

Analysis of Variation
ANOVA:
Learning Objectives

• Underlying methodological principles of

ANOVA
• Statistical principles: Partitioning of
variability
• Summary table for one-way ANOVA
t-Test vs ANOVA:
The Case for Multiple Groups
• t-tests can be used to compare mean differences for two
groups
• Between Subject Designs
• Within Subject Designs
• Paired Samples Designs

• The test allows us to make a judgment concerning

whether or not the observed differences are likely to have
occurred by chance.
• Although helpful the t-test has its problems
Multiple Groups
• Two groups are often insufficient
• No difference between Therapy X and Control:
Are other therapies effective?
• .05mmg/l Alcohol does not decrease memory
ability relative to a control: What about other
doses?
• What if one control group is not enough?
Multiple Groups 2
• With multiple groups we could make multiple
comparisons using t-tests
• Problem: We would expect some differences in
means to be significant by chance alone
• How would we know which ones to trust?
Comparing Multiple Groups: The
“One-Way” Design
Alcohol Level (Dose in mg/kg)
0 (Control) 10 20

.......
• Independent variable (“factor”):

• Dependent variable
(“measurement”):
• Analysis: Variation between and within conditions
Analysis of Variance (F test):
Advantages
•
 Provides an omnibus test & avoids multiple t-tests and
spurious significant results
•
 it is more stable since it relies on all of the data ...
•
recall from your work on Std Error and T-tests
• the smaller the sample the less
stable are the population
parameter estimates.
What does ANOVA do

• Provides an F ratio that has an underlying

distribution which we use to determine statistical
significance between groups (just like a t-test or a
z-test)
• e.g., Take an experiment in which subjects are
randomly allocated to 3 groups
• The means and std deviations will all be different from
each other
• We expect this because that is the nature of sampling
(as you know!)
The question is …
are the groups more different than
we would expect by chance?
How does ANOVA work?

• Instead of dealing with means as data points we

deal with variation
• There is variation (variance) within groups (data)
• There is variance between group means (Exptl
Effect)
• If groups are equivalent then the variance
between and within groups will be equal.
• Expected variation is used to calculate statistical
significance in the same way that expected
differences in means are used in t-tests or z-tests
The basic ANOVA situation
Two variables: 1 Categorical, 1 Quantitative

Main Question: Do the (means of) the quantitative

variables depend on which group (given by
categorical variable) the individual is in?

If categorical variable has only 2 values:

• 2-sample t-test

ANOVA allows for 3 or more groups

An example ANOVA situation

Subjects: 25 patients with blisters

Treatments: Treatment A, Treatment B, Placebo
Measurement: # of days until blisters heal

Data [and means]:

• A: 5,6,6,7,7,8,9,10 [7.25]
• B: 7,7,8,9,9,10,10,11 [8.875]
• P: 7,9,9,10,10,10,11,12,13 [10.11]

Are these differences significant?

Informal Investigation
Graphical investigation:
• side-by-side box plots
• multiple histograms

Whether the differences between the groups are

significant depends on
• the difference in the means
• the standard deviations of each group
• the sample sizes

ANOVA determines P-value from the F statistic

Side by Side Boxplots

11
10
days

8
7

A B P
treatment
What does ANOVA do?
At its simplest (there are extensions) ANOVA tests the
following hypotheses:
H0: The means of all the groups are equal.

Ha: Not all the means are equal

doesn’t say how or which ones differ.
Can follow up with “multiple comparisons”
Note: we usually refer to the sub-populations as
“groups” when doing ANOVA.
Assumptions of ANOVA
• each group is approximately normal
• check this by looking at histograms or use
assumptions
• can handle some non-normality,
but not severe outliers
• standard deviations of each group are
approximately equal
• rule of thumb: ratio of largest to
smallest sample st. dev. must be less
than 2:1
Normality Check

We should check for normality using:

• assumptions about population
• histograms for each group

With such small data sets, there really isn’t a

really good way to check normality from data,
but we make the common assumption that
physical measurements of people tend to be
normally distributed.
Notation for ANOVA

• n = number of individuals all together

• k = number of groups
•x = mean for entire data set is
Group i has
• ni = # of individuals in group i
• x = value for individual j in group i
ij
• x = mean for group i
i

• s = standard deviation for group i

i
How ANOVA works (outline)?
ANOVA measures two sources of variation in the data and compares their
relative sizes

• variation BETWEEN groups

• for each data value look at the difference between its
group mean and the overall mean

• variation WITHIN groups

• for each data value we look at the difference
between that value and the mean of its group
The ANOVA F-statistic is a ratio of the
Between Group Variaton divided by the
Within Group Variation:

F
Between 
MSB Within
MSWagainst H , since it
A large F is evidence 0

indicates that there is more difference between

groups than within groups.
How are these computations made?

We want to measure the amount of variation due to

BETWEEN group variation and WITHIN group
variation

to:

For each data value, we calculate its contribution
xi  x
• BETWEEN group variation: 2
(xij  xi )
2
• WITHIN group variation:
An even smaller example
Suppose we have three groups
• Group 1: 5.3, 6.0, 6.7
• Group 2: 5.5, 6.2, 6.4, 5.7
• Group 3: 7.5, 7.2, 7.9
We get the following statistics:

SUMMARY
Groups Count Sum Average Vara ince
Column 1 3 18 6 04.9
Column 2 4 238. 59.5 01.76667
Column 3 3 226. 75.33333 01.23333
ANOVA Output
Analysis of Variance for days
Source DF SS MS F P
treatment 2 34.74 17.37 6.45 0.006
Error 22 59.26 2.69
Total 24 94.00

# of data values - # of groups

(equals df for each group

1 less than # of
added together)
groups

1 less than # of individuals

(just like other situations)
ANOVA
Analysis
Source DF
Output
of Variance for days
SS MS F P
treatment 2 34.74 17.37 6.45 0.006
Error 22 59.26 2.69
Total 24 94.00

MSB = SSB / DFB

MSW = SSW / DFW
P-value
comes from
F = MSB / MSW F(DFB,DFW)

(P-values for the F statistic are in Table E)

So How big is F?
Since F is
Mean Square Between / Mean Square Within

= MSB / MSW

A large value of F indicates relatively more difference

between groups than within groups (evidence
against H0)

To get the P-value, we compare to F(I-1,n-I)-distribution

• I-1 degrees of freedom in numerator (# groups -1)
• n - I degrees of freedom in denominator (rest of df)
Where’s the Difference?
Once ANOVA indicates that the groups do not all
appear to have the same means, what do we do?

Analysis of Variance for days

Source DF SS MS F P
treatmen 2 34.74 17.37 6.45 0.006
Error 22 59.26 2.69
Total 24 94.00
Individual 95% CIs For Mean
Based on Pooled StDev
Level N Mean StDev ----------+---------+---------+------
A 8 7.250 1.669 (-------*-------)
B 8 8.875 1.458 (-------*-------)
P 9 10.111 1.764 (------*-------)
----------+---------+---------+------
Pooled StDev = 1.641 7.5 9.0 10.5

Clearest difference: P is worse than A (CI’s don’t overlap)

Analysis of Variance
No ratings yet
Analysis of Variance
40 pages
Anova
No ratings yet
Anova
59 pages
16 Anova Updated
No ratings yet
16 Anova Updated
68 pages
Analysis of Variance
No ratings yet
Analysis of Variance
57 pages
Anova
No ratings yet
Anova
34 pages
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
No ratings yet
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
30 pages
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
No ratings yet
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
30 pages
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
No ratings yet
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
30 pages
ANOVA for Statistical Analysis
50% (2)
ANOVA for Statistical Analysis
38 pages
ANOVA
No ratings yet
ANOVA
12 pages
Anova Mab2024
No ratings yet
Anova Mab2024
30 pages
Chapter 5 Analysis of Variance (ANOVA)
No ratings yet
Chapter 5 Analysis of Variance (ANOVA)
10 pages
ANOVA
No ratings yet
ANOVA
29 pages
ANOVA: Understanding Analysis of Variance
No ratings yet
ANOVA: Understanding Analysis of Variance
22 pages
ANOVA
No ratings yet
ANOVA
38 pages
1 Lecture-4
No ratings yet
1 Lecture-4
54 pages
ANOVA F Value
No ratings yet
ANOVA F Value
22 pages
ANOVA for Statistical Analysis
100% (1)
ANOVA for Statistical Analysis
52 pages
Bodo Winter's ANOVA Tutorial
100% (1)
Bodo Winter's ANOVA Tutorial
18 pages
One-Way ANOVA
No ratings yet
One-Way ANOVA
28 pages
12 Anova
No ratings yet
12 Anova
43 pages
Psychology Research Method
No ratings yet
Psychology Research Method
57 pages
Unit 4-1
No ratings yet
Unit 4-1
38 pages
Anova Fvalue
No ratings yet
Anova Fvalue
20 pages
Lecture Notes 13: ANOVA (A.k.a. Analysis of Variance)
No ratings yet
Lecture Notes 13: ANOVA (A.k.a. Analysis of Variance)
34 pages
Analysisof Variance
No ratings yet
Analysisof Variance
44 pages
14 Anova1
No ratings yet
14 Anova1
31 pages
Introduction To Analysis of VarianceC
No ratings yet
Introduction To Analysis of VarianceC
35 pages
ANOVA 6slides Presentation
No ratings yet
ANOVA 6slides Presentation
6 pages
Edited Analysis of Variance - Final Anova
No ratings yet
Edited Analysis of Variance - Final Anova
58 pages
RM Unit-4
No ratings yet
RM Unit-4
45 pages
Anova
No ratings yet
Anova
28 pages
ANOVA Guide for Statisticians
No ratings yet
ANOVA Guide for Statisticians
43 pages
ANOVA and Chi-square Explained
0% (1)
ANOVA and Chi-square Explained
26 pages
Lesson 4 Exploring Agricultural Insights With Anova in Python
No ratings yet
Lesson 4 Exploring Agricultural Insights With Anova in Python
9 pages
ANOVA for Statistical Analysis
No ratings yet
ANOVA for Statistical Analysis
18 pages
A Simple Introduction To ANOVA
No ratings yet
A Simple Introduction To ANOVA
20 pages
ANOVA Concepts & Calculations
No ratings yet
ANOVA Concepts & Calculations
11 pages
Basic Anova PDF
No ratings yet
Basic Anova PDF
6 pages
Anova
No ratings yet
Anova
6 pages
Understanding One-Way & Two-Way ANOVA
No ratings yet
Understanding One-Way & Two-Way ANOVA
31 pages
ANova & Experiemntal Design
No ratings yet
ANova & Experiemntal Design
40 pages
BST 32202 Linear Regression 3 Anova One Way
No ratings yet
BST 32202 Linear Regression 3 Anova One Way
29 pages
BW Anova General
No ratings yet
BW Anova General
18 pages
18MEO113T - DOE - Unit 5 - AY2023 - 24 ODD
No ratings yet
18MEO113T - DOE - Unit 5 - AY2023 - 24 ODD
76 pages
L4 Anova - 082842
No ratings yet
L4 Anova - 082842
18 pages
ANOVA for Business Jet Prototypes
No ratings yet
ANOVA for Business Jet Prototypes
46 pages
What Is Analysis of Variance (ANOVA) ?: Z-Test Methods
No ratings yet
What Is Analysis of Variance (ANOVA) ?: Z-Test Methods
7 pages
ANOVA For Class - Dec 6, 2018
No ratings yet
ANOVA For Class - Dec 6, 2018
37 pages
Anova R
No ratings yet
Anova R
17 pages
T (Ea) For Two
No ratings yet
T (Ea) For Two
31 pages
Chapter 4 Hypotheses Testing of More Than Two Populations
No ratings yet
Chapter 4 Hypotheses Testing of More Than Two Populations
90 pages
ANOVA Ajay 29 11 21
No ratings yet
ANOVA Ajay 29 11 21
50 pages
One-Way ANOVA: We Will Cover Only Independent-Measures Designs Involving Only One Independent Variable (One-Way ANOVA)
No ratings yet
One-Way ANOVA: We Will Cover Only Independent-Measures Designs Involving Only One Independent Variable (One-Way ANOVA)
2 pages
Da Anova Tests
No ratings yet
Da Anova Tests
6 pages
18MEO113T - DOE - Unit 5 - AY2023 - 24 ODD
No ratings yet
18MEO113T - DOE - Unit 5 - AY2023 - 24 ODD
76 pages
Final Exam
No ratings yet
Final Exam
47 pages
Case Processing Summary: Lama Kerja
No ratings yet
Case Processing Summary: Lama Kerja
2 pages
ML Lecture16
No ratings yet
ML Lecture16
39 pages
Exercises Dobson
0% (1)
Exercises Dobson
3 pages
Statistics for Data Analysts
No ratings yet
Statistics for Data Analysts
47 pages
1.8.4 Test (TST) - Statistical Analysis (Test)
No ratings yet
1.8.4 Test (TST) - Statistical Analysis (Test)
12 pages
Revision Exercise Stats 101
No ratings yet
Revision Exercise Stats 101
5 pages
Linear Regression for Real Estate
No ratings yet
Linear Regression for Real Estate
12 pages
AS MCQ New
100% (2)
AS MCQ New
13 pages
Forecasting Models & Techniques
No ratings yet
Forecasting Models & Techniques
25 pages
Reliab 3 Bagus
No ratings yet
Reliab 3 Bagus
22 pages
Sampleexam 2 Questions
No ratings yet
Sampleexam 2 Questions
5 pages
MANOVA in SPSS Statistics
No ratings yet
MANOVA in SPSS Statistics
23 pages
This Study Resource Was: Problem # 1
No ratings yet
This Study Resource Was: Problem # 1
7 pages
Mathematica Laboratories For Mathematical Statistics (ASA-SIAM Series On Statistics and Applied Probability) (Jenny A. Baglivo) 0898715660
No ratings yet
Mathematica Laboratories For Mathematical Statistics (ASA-SIAM Series On Statistics and Applied Probability) (Jenny A. Baglivo) 0898715660
281 pages
Using Gretl
No ratings yet
Using Gretl
749 pages
Designing Comparative Experiments: Points of View
No ratings yet
Designing Comparative Experiments: Points of View
2 pages
Operation Management
No ratings yet
Operation Management
43 pages
16-17 Chapter 10 Quiz Review
No ratings yet
16-17 Chapter 10 Quiz Review
6 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
3 pages
Lesson 4: Independent-Samples T Test: Objectives
No ratings yet
Lesson 4: Independent-Samples T Test: Objectives
7 pages
Kassahun Hassen - The Levinson Algorithm - Seminar
No ratings yet
Kassahun Hassen - The Levinson Algorithm - Seminar
5 pages
Advanced Statistics Exam Prep
No ratings yet
Advanced Statistics Exam Prep
3 pages
Question Paper
No ratings yet
Question Paper
4 pages
Instant Download of Statistical and Econometric Methods For Transportation Data Analysis 1st Edition Simon P. Washington Ebook PDF, Every Chapter
100% (16)
Instant Download of Statistical and Econometric Methods For Transportation Data Analysis 1st Edition Simon P. Washington Ebook PDF, Every Chapter
86 pages
Time Series Analysis - COMPLETE
No ratings yet
Time Series Analysis - COMPLETE
15 pages
Chapter 14-Introduction To Multiple Regression
No ratings yet
Chapter 14-Introduction To Multiple Regression
67 pages
Pengaruh Kepribadian, Orientasi Kerja Dan Penempatan Pegawai Terhadap Kinerja Karyawan Pt. Advantage Supply Chain Management (SCM) Cabang Batam
No ratings yet
Pengaruh Kepribadian, Orientasi Kerja Dan Penempatan Pegawai Terhadap Kinerja Karyawan Pt. Advantage Supply Chain Management (SCM) Cabang Batam
14 pages
AP Statistics Confidence Intervals
No ratings yet
AP Statistics Confidence Intervals
11 pages
Experimental Design Techniques
No ratings yet
Experimental Design Techniques
18 pages
Statistical Process Control Guide
No ratings yet
Statistical Process Control Guide
75 pages

Understanding ANOVA: Key Concepts

Uploaded by

Understanding ANOVA: Key Concepts

Uploaded by

ANOVA:

• Underlying methodological principles of

• The test allows us to make a judgment concerning

• Provides an F ratio that has an underlying

• Instead of dealing with means as data points we

Main Question: Do the (means of) the quantitative

If categorical variable has only 2 values:

ANOVA allows for 3 or more groups

Subjects: 25 patients with blisters

Data [and means]:

Are these differences significant?

Whether the differences between the groups are

ANOVA determines P-value from the F statistic

Ha: Not all the means are equal

We should check for normality using:

With such small data sets, there really isn’t a

• n = number of individuals all together

• s = standard deviation for group i

• variation BETWEEN groups

• variation WITHIN groups

indicates that there is more difference between

We want to measure the amount of variation due to

# of data values - # of groups

(equals df for each group

1 less than # of individuals

MSB = SSB / DFB

(P-values for the F statistic are in Table E)

A large value of F indicates relatively more difference

To get the P-value, we compare to F(I-1,n-I)-distribution

Analysis of Variance for days

Clearest difference: P is worse than A (CI’s don’t overlap)

You might also like