1-WAY ANALYSIS OF VARIANCE (ANOVA)

 

Purpose

 

Compare three or more population means, m, using sample means, , when data is collected in an independent groups/between participants type of design.

 

BACKGROUND

 

1.                  Participants can be randomly chosen from the population, or obtained by some other, more convenient means (haphazard/convenient sampling).

2.                  Participants are either randomly assigned to 1 level of the independent variable, such as in an experiment/manipulated design, or can be categorized based on membership to some type of subject variable (academic major, for example), such as in a non-manipulated/correlational design).    

 

THE NULL HYPOTHESIS

 

The null hypothesis for ANOVA states that the population means from which the samples are taken are equal.  In other words:

 

m1= m2 = m3 =…= mk

 

The mean of the population, m, refers to the hypothetical value that would result if all the individuals from the population had been observed.

 

LOGIC OF THE TEST

 

An ANOVA tests mean differences among 3 or more groups by analyzing the variance in the data collected.  The estimate for the total amount of variance in the data can be divided into estimates for between and within groups variance.

 

Estimate of the total variance in the data

=

Estimate of the between subjects variance

+

Estimate of the within subjects variance

 

 

The F-value is the ratio of between groups variance to within groups variance.  Conceptually, the F ratio can be thought of using the following formula:

 

 

Variance is a measure of how disperse scores are about some central value.  Recall, to estimate the population variance based on scores from a sample, we use the formula:

 

 

DEGREES OF FREEDOM

 

ANOVA estimates two population parameters (between and within groups variance) therefore two types the degrees of freedom are used.

 

The degrees of freedom calculated for the between subjects variance estimate is:

 

k - 1

Where:

k = the number of conditions

 

The degrees of freedom calculated for the within subjects variance estimate is:

 

N - k

Where:

N = the total number of participants in the study

k = the number of conditions

 

The between subjects degrees of freedom is often referred to as the degrees of freedom in the numerator, while the within subjects degrees of freedom is often referred to as the degrees of freedom in the denominator because the estimate for the between subjects variance is in the numerator of the F ratio and the estimate for the within subjects variance is in the denominator of the F ratio.

 

1-WAY ANOVA FORMULA

 

 

Where:

msbetween = the estimated between subjects variance

            mswithin = the estimated within subjects variance

 

THE BETWEEN SUBJECTS VARIANCE ESTIMATE:

 

 

 

THE WITHIN SUBJECTS VARIANCE ESTIMATE:

 

 

 

 

SAMPLING DISTRIBUTION OF F

 

The probability distribution of F-values that would occur if all possible samples of fixed sizes for N and k were drawn from the null population.  Gives two pieces of information:

1.                  All possible values for F of sample size N with groups k

2.                  Probability of getting each values of F if sampling was random from the null hypothesis population.

 

If the null hypothesis was true (i.e., if m1= m2= m3=…= mk) then the F-value should equal 1.  However, due to sampling fluctuation, occasionally one mean will be slightly lower (or higher) than the others.  In rare cases (i.e., on the tail of the sampling distribution) the means are very different, merely due to chance fluctuation.

 

The means may be very different even if the null is true.  It is not likely, but it can occur.

 

STATING A CONCLUSION

 

Your conclusion should contain several pieces of information. First, state if the alternative hypothesis was supported (if null was rejected) or refuted (if null was retained). Next state the statistics, both descriptive (i.e., means) and inferential (i.e., F-ratio, df, alpha level, critical value) using proper APA format. Third, tell if any causal relationship exists (only when null is rejected and experimental design used). Lastly, calculate effect sizes and discuss the strength of the relationship between the variables. Example:

 

The hypothesis that type of beverage consumed was related to perceived attractiveness was supported. The omnibus F test of perceived attractiveness ratings for participants who consumed non-alcoholic beer (M = 7), regular beer (M = 8) and water (M = 4) was significant, F (2, 9) = 6.49, p < .05, given Fcrit = 4.26. The type of beverage caused differences in perceived attractiveness. The strength of the relationship between type of beverage and perceived attractiveness was medium (h2 = .59, w2 = .48).

                                                                                                                                             

 

CALCULATING AN ANOVA FOR INDEPENDENT SAMPLES FROM RAW DATA

 

Step 1: Calculate the msbetween

 

            Step 1a: Calculate SSbetween

            Step 1b: Determine dfbetween

            Step 1c: Divide SSbetween by dfbetween

 

Step 2: Calculate the mswithin

 

            Step 1a: Calculate SSwithin

            Step 1b: Determine dfwithin

            Step 1c: Divide SSwithin by dfwithin

 

Step 3: Calculate the F-ratio

 

Step 4: Create an F-table

 

Source             SS        df         ms        F         

 

Between

 

Within

 

Total

 

Step 5: Evaluate the F-value

 

Step 6: Interpret results using group means