
Some software packages perform the Levene test, but they will not neces sarily
get the same answer becau se they do not necessarily use absolute deviations from
the mean. For example, MINITAB uses absolute residuals with respect to the
median, an especially good idea in case of skewed data. By default, SAS uses the
squared deviations from the mean, although the absolute deviations from the mean
can be requested. SAS also allows absolute deviations from the median (as the BF
test, because Brown and Forsythe studied this procedure).
The ANOVA F-test is pretty robus t to both the normality and constant
variance assumptions. The test will still work under moderate departures from
these two assumptions. When the sample sizes are all the same, as we are assuming
so far, the test is especially insensitive to unequal variances. Also, there is a
generalization of the two-sample t-test of Section 10.2 for more than two samples,
and it does not demand equal variances. This test is available in JMP, R, and SAS.
If there is a major violation of assumptions, then the situation can sometimes
be corrected by a data transformation, as discussed in Section 11.3. Alternatively,
the bootstrap can be used, by generalizing the method of Section 10.6 from two
groups to several . There is also a nonparametric test (no normality required), as
discussed in Exercise 37 of Chapter 14.
Exercises Section 11.1 (1–10)
1. An experiment to compare I ¼ 5 brands of golf
balls involved using a robotic driver to hit J ¼ 7
balls of each brand. The resulting between-sample
and within-sample estimates of s
2
were MSTr ¼
123.50 and MSE ¼ 22.16, respectively.
a. State and test the relevant hypotheses using a
significance level of .05.
b. What can be said about the P-value of the test?
2. The lumen output was determined for each of I ¼ 3
different brands of 60-watt soft-white lightbulbs,
with J ¼ 8 bulbs of each brand tested. The sums
of squares were computed as SSE ¼ 4773.3 and
SSTr ¼ 591.2. State the hypotheses of interest
(including word definitions of parameters), and
use the F test of ANOVA (a ¼ .05) to decide
whether there are any differences in true average
lumen outputs among the three brands for this type
of bulb by obtaining as much information as possi-
ble about the P-value.
3. In a study to assess the effects of malaria infection on
mosquito hosts (“Plasmodium cynomolgi: Effects of
Malaria Infection on Laboratory Flight Performance
of Anopheles stephensi Mosquitos,” Exp. Parasitol.,
1977: 397–404), mosquitoes were fed on either infec-
tive or noninfective rhesus monkeys. Subsequently
the distance they flew during a 24-h period was
measured using a flight mill. The mosquitoes were
divided into four groups of eight mosquitoes each:
infective rhesus and sporozites present (IRS),
infective rhesus and oocysts present (IRD), infective
rhesus and no infection developed (IRN), and
noninfective (C). The summary data values are
x
1
¼ 4:39 IRSðÞ,
x
2
¼ 4:52 IRDðÞ,
x
3
¼
5:49 IRNðÞ,
x
4
¼ 6:36 CðÞ,
x
¼ 5:19, and
PP
x
2
ij
¼ 911:91. Use the ANOVA F test at level
.05 to decide whether there are any differences
between true average flight times for the four
treatments.
4. Consider the following summary data on the mod-
ulus of elasticity ( 10
6
psi) for lumber of three
different grades (in close agreement with values in
the article “Bending Strength and Stiffness of Sec-
ond-Growth Douglas-Fir Dimension Lumber”
(Forest Products J., 1991: 35–43), except that the
sample sizes there were larger):
Grade J
x
i
s
i
1 10 1.63 .27
2 10 1.56 .24
3 10 1.42 .26
Use this data and a significance level of .01 to test
the null hypothesis of no difference in mean modu-
lus of elasticity for the three grades.
5. The article “Origin of Precambrian Iron Forma-
tions” (Econ. Geol., 1964: 1025–1057) reports the
11.1 Single-Factor ANOVA 563