ONE-WAY
ANALYSIS OF VARIANCE (ANOVA)
Purpose
To
test the affect that one categorical variable (IV or SV) has on another
continuous variable (DV). A 1-way ANOVA is used to compare three or more
population means, m, using sample means,
.
BACKGROUND
1.
Participants can be randomly chosen from the population or obtained by
some other (more convenient) means (e.g., haphazard/convenient sampling).
2.
Participants are either randomly assigned to 1 level of the independent
variable, such as in an experimental design, or can be categorized based on
membership to some type of subject variable (academic major, for example), such
as in a non-manipulated/correlational design. This type of assignment would be
a between-participants/independent groups design.
·
It is also possible that participants experience all levels of an
independent variable (within-participants/repeated measures design).
THE NULL HYPOTHESIS
The
null hypothesis for ANOVA states that the population means from which the
samples are taken are equal. In other
words:
m1= m2 = m3 =…= mk
The mean of the population, m, refers to the hypothetical
value that would result if all the individuals from the population had been
observed.
The null hypothesis states that there is at least 1 pair of means that are
different. It does not state that all means are different. To determine which
means are different you should conduct a post-hoc analysis.
LOGIC
OF THE TEST
An
ANOVA tests mean differences among 3 or more groups by analyzing the variance
in the data collected. The estimate for
the total amount of variance in the data can be divided into estimates for
between and within groups variance.
|
Estimate of the total
variance in the data |
= |
Estimate of the between
subjects variance |
+ |
Estimate of the within
subjects variance |
·
The estimate of the total variance in the data is the average distance
from each score to the grand mean (X - m).
·
The estimate of the between subjects variance is the average distance
from the group means to the grand mean (
- m). It is the variance that we can explain -- it is good variance.
·
The estimate of within subjects variance is the average distance from
each score to the group mean (X -
). It is the variance that we can't explain -- it is
bad variance.
The
F-value is the ratio of between groups variance to within groups variance. Conceptually, the F ratio can be thought of
using the following formula:
![]()
Ideally
the F-ratio would be large, the proportion of good
variance should be higher than the proportion of bad variance. Each of the
estimates are calculated by dividing the Sum of Squares by the appropriate
degrees of freedom (k -1 & N - k)
THE FORMULA
![]()
;
; dfB
= k – 1
;
;
dfW
= N – k
Where n is the number of participants per group
N is the number of participants in
the entire sample
k is the number
of groups
THE SUMMARY TABLE
As you
complete the calculations it is helpful to complete a summary table. You should
total the SS and df columns.
You should place an asterisk (*) next to the F-ratio if it is statistically
significant.
Source SS df MS F h2
Between
Within
_______________________________
Total
STATING A CONCLUSION
Your
conclusion should contain several pieces of information. First, state if the
alternative hypothesis was supported (if null was rejected) or refuted (if null
was retained). Next state the statistics, both descriptive (i.e., means) and
inferential (i.e., F-ratio, df,
alpha level) using proper APA format. Third, tell if any causal relationship
exists (only when null is rejected and experimental design used). Lastly,
determine the effect sizes and discuss the strength of the relationship between
the variables. Example:
The hypothesis that type of
beverage consumed was related to perceived attractiveness was supported. The
omnibus F test of perceived attractiveness ratings for participants who
consumed non-alcoholic beer (M = 7), regular beer (M = 8) and
water (M = 4) was significant, F (2, 9) = 6.49, p <
.05. The type of beverage caused
differences in perceived attractiveness because type of beverage was
manipulated. The strength of the relationship between type of beverage and
perceived attractiveness was medium (h2 = .59).