Question 1

What does the F-test for equality of variances test?

Accepted Answer

It tests the null hypothesis H₀: σ₁² = σ₂² against the alternative H₁: σ₁² ≠ σ₂². A significant result (p ≤ α) means the two population variances are statistically different. A non-significant result means the data are consistent with equal variances, but does not prove they are equal.

Question 2

Why is the F-test used before a two-sample t-test?

Accepted Answer

The pooled two-sample t-test assumes that both groups have the same population variance. If this assumption is violated, the test can produce incorrect p-values. Running an F-test first checks this assumption: if the F-test is significant, use the Welch t-test instead, which does not assume equal variances.

Question 3

What are the assumptions of the F-test for equality of variances?

Accepted Answer

Both samples must be drawn from normally distributed populations, and the samples must be independent of each other. The F-test is quite sensitive to non-normality — even moderate departures can distort the p-value. If normality is doubtful, use Levene's test or the Brown–Forsythe test instead.

Question 4

Why is the larger variance always placed in the numerator?

Accepted Answer

Placing the larger variance in the numerator ensures F ≥ 1, which confines the critical region to the upper tail of the F-distribution and avoids the need to consult a lower-tail table. For a two-tailed test, the p-value is then simply 2 × P(F > F_obs), which is straightforward to compute.

Question 5

How do I interpret the critical F-value?

Accepted Answer

The critical F-value (F_crit) is the value that cuts off the top α/2 of the F-distribution. If your calculated F exceeds F_crit, you reject H₀ at significance level α. Using the p-value and using the critical value always lead to the same decision — they are two equivalent ways of summarising the same comparison.

Question 6

When should I use Levene's test instead of the F-test?

Accepted Answer

Levene's test is preferable when your data may not follow a normal distribution, because it is robust to non-normality. The F-test for equality of variances is the optimal test when normality holds, but its Type I error rate can be severely distorted by skewed or heavy-tailed data. In practice, many statisticians use Levene's test by default to avoid this risk.

Input	Result	Context
s₁² = 0.34, n₁ = 25; s₂² = 0.29, n₂ = 25; α = 0.05	F = 1.1724, p ≈ 0.6767 — fail to reject H₀	Two machines produce bolts. Bolt-diameter variances are not significantly different; both machines are equally consistent.
s₁² = 110, n₁ = 41; s₂² = 125, n₂ = 31; α = 0.05	F = 1.1364, p ≈ 0.6679 — fail to reject H₀	Two teaching methods. Test-score variances are statistically equal; both methods produce similar consistency of outcomes.
s₁² = 5.2, n₁ = 100; s₂² = 4.8, n₂ = 100; α = 0.01	F = 1.0833, p ≈ 0.6366 — fail to reject H₀	Two stocks compared for daily-return volatility. At the 1% level there is no evidence of different risk profiles.
s₁² = 18, n₁ = 16; s₂² = 12, n₂ = 16; α = 0.10	F = 1.5, p ≈ 0.3952 — fail to reject H₀	Plant height under two fertilisers. Variance in plant growth is not statistically different at the 10% level.

F-Test for Equality of Two Variances Calculator

Group 1

Group 2

About the F-Test for Equality of Two Variances

F-Test for Equality of Variances — Examples

How to use the F-Test for Equality of Variances Calculator

F-Test for Equality of Variances — FAQ