Devore J.L., Berk K.N. Modern Mathematical Statistics with Applications

Подождите немного. Документ загружается.



5



3



2



4



1

13.1 13.3 13.8 14.3 14.5

Thus brands 1 and 4 are not significantly different from each other, but are

significantly higher than the other three brands in their true aver age amo unts

captured. Brand 2 is significantly better than 3 and 5 but wor se than 1 and 4, and

brands 3 and 5 do not differ significantly.



2

¼ 14:15 rather than 13.8 with the same computed w, then the configu-

ration of underscored means would be



5



2



4



1

13.1 13.3 14.15 14.3 14.5

■

Example 11.5 A biologist wished to study the effects of ethanol on sleep time. A sample of 20 rats,

matched for age and other characteristics, was selected, and each rat was given an

oral injection having a particular concentration of ethanol per kg of body weight.

The rapid eye movement (REM) sleep time for each rat was then recorded for a

24-h period, with the following resu lts:

Treatment

(ethanol) REM time x

i



i

0 (control) 88.6 73.2 91.4 68.0 75.2 396.4 79.28

1 g/kg 63.0 53.9 69.2 50.1 71.5 307.7 61.54

2 g/kg 44.9 59.5 40.2 56.3 38.7 239.6 47.92

4 g/kg 31.0 39.6 45.3 25.2 22.7 163.8 32.76



¼ 1107.5





¼ 55.375

Does the data indi cate that the true average REM sleep time depends on the

concentration of ethanol? (This example is based on an experiment reported in

“Relationship of Ethanol Blood Level to REM and Non-REM Sleep Time and

Distribution in the Rat,” Life Sci., 1978: 839–846.)

The



i

’s differ rather substantially from each other, but there is also a great

deal of variability within each sample, so to answer the question precisely we must

carry out the ANOVA. With

¼ 68;697:6 and correction factor



= IJðÞ¼1107:5ðÞ

=20 ¼ 61; 327:8, the computing formulas yield

SST ¼ 68; 697:6  61 ; 327:8 ¼ 7369 :8

SSTr ¼

396:40

þ 307:70

þ 239:60

þ 163:80



 61; 327:8

¼67; 210:2  61; 327:8 ¼ 5882:4

and

SSE ¼ 7369:8  5882:4 ¼ 1487:4

Table 11.4 is a SAS ANOVA table. The last column gives the P-value, which

is .0001. Actually, the P-value is .0000083, but SAS does not output anything lower

than .0001. It does not output .0000 because this could be misinterpreted to say that

the P-value is 0. Using a significance level of .05, we reject the null hypothesis H

¼ m

, since the given P-value ¼ .0001 < .05 ¼ a. True average

REM sleep time does appear to depend on ethanol concentration.

568 CHAPTER 11 The Analysis of Variance

There are I ¼ 4 treatme nts and 16 df for error, so Q

.05,4,16

¼ 4.05 and

w ¼ 4:05

ﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ

93:0=5

¼ 17:47. Ordering the means and underscoring yields



4



3



2



1

32.76 47.92 61.54 79.28

The interpretation of this underscoring must be done with care , since we seem to

have concluded that treatments 2 and 3 do not differ, 3 and 4 do not differ, yet 2 and

4 do differ. The suggested way of expressing this is to say that although evidence

allows us to conclude that treatments 2 and 4 differ from each other, neither has

been shown to be significantly different from 3. Treatment 1 has a significantly

higher true average REM sleep time than any of the other treatments. This treat-

ment involves 0 ethanol (alcohol) and there is a trend toward less sleep with more

ethanol, although not all differences are significant.

Figure 11.4 shows SAS output from the application of Tukey’s procedure.

The Interpretation of a in Tukey’s Procedure

We stated previously that the simultaneous confidence level is controlled by

Tukey’s method. So what does “simultaneous” mean here? Consider calculating

a 95% CI for a population mean m based on a sample from that population and then

Table 11.4 SAS ANOVA table

Analysis of variance procedure

Dependent Variable: TIME

Sum of Mean

Source DF Squares Square F Value Pr

> F

Model 3 5882.35750 1960.78583 21.09 .0001

Error 16 1487.40000 92.96250

Corrected

Total 19 7369.75750

Alpha = 0.05 df = 16 MSE = 92.9625

Critical Value of Studentized Range = 4.046

Minimum Significant Difference = 17.446

Means with the same letter are not significantly different.

Tukey Grouping Mean N TREATMENT

A 79.280

5 0(control)

B 61.540

5 1 gm/kg

B 47.920

5 2 gm/kg

32.760

5 4 gm/kg

Figure 11.4 Tukey’s method using SAS ■

11.2 Multiple Comparisons in ANOVA 569

a 95% CI for a population proportion p based on another sample selected indepen-

dently of the first one. Prior to obtaining data, the probability that the first interval

will include m is .95, and this is also the probability that the second interval will

include p. Because the two samples are selected independently of each other, the

probability that both intervals will include the values of the respective parameters is

(.95)(.95) ¼ (.95)

 .90. Thus the simultaneous or joint confidence level for the

two intervals is roughly 90%—if pairs of intervals are calculated over and over

again from independent samples, in the long run roughly 90% of the time the first

interval will capture m and the second will include p. Similarly, if three CIs are

calculated based on independent samples, the simultaneous confidence level will

be 100(.95)

%  86%. Clearly, as the number of intervals increases, the simulta-

neous confidence level that all intervals capture their respective parameters will

decrease.

Now suppose that we want to maintain the simultaneous confidence level at

95%. Then for two independent samples, the individual confidence level for each

would have to be 100

ﬃﬃﬃﬃﬃﬃﬃ

:95

%  97:5%. The larger the number of intervals, the

higher the individual confidence level would have to be to maintain the 95%

simultaneous level.

The tricky thing about the Tukey intervals is that they are not based on

independent samples—MSE appears in every one, and various intervals share the

same



i

’s (e.g., in the case I ¼ 4, three different intervals all use



1

). This implies

that there is no straightforward probability argument for ascertaining the simulta-

neous confidence level from the individual confidence levels. Nevertheless, if Q

.05

is used, the simultaneous confidence level is controlled at 95%, whereas using Q

.01

gives a simultaneous 99% level. To obta in a 95% simultaneous level, the individual

level for each interval must be considerably larger than 95%. Said in a slightly

different way, to obtain a 5% experimentwise or family error rate, the individual or

per-comparison error rate for each interval must be considerably smaller than .05.

MINITAB asks the user to specify the family error rate (e.g., 5%) and then includes

on output the individual error rate (see Exercise 16).

Conﬁdence Intervals for Other Parametric Functions

In some situations, a CI is desired for a function of the m

’s more complicated than a

difference m

–m

. Let y ¼ Sc

, where the c

’s are constants. One such function is

ðm

þ m

Þ

ðm

þ m

Þ, which in the context of Example 11.4 measures the

difference between the group consi sting of the first two brands and that of the last

three brands. Because the X

’s are normally distributed with E(X

) ¼ m

and

V(X

) ¼ s

y ¼ S

i

is normally distributed, unbiased for y, and

Vð

yÞ¼Vð

i

Þ¼

VðX

i

Þ¼

Estimating s

by MSE and forming

results in a t variable ð

y  yÞ=

, which can

be manipulated to obtain the following 100(1 – a)% confidence interval for Sc



i

 t

a=2;IðJ1Þ

ﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ

ðMSE

Þ=J

ð11:5Þ

570 CHAPTER 11 The Analysis of Variance

Example 11.6

(Example 11.4

continued)

The parametric function for comparing the first two (store) brand s of oil filter with

the last three (national) brands is y ¼

ðm

þ m

2Þ



ðm

þ m

Þ, from which



þ



þ



þ



With

y ¼



1



2

Þ



3



4



5

Þ¼:583 and MSE ¼ .088, a 95%

interval is

:583  2:021

ﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ

5ð:088Þ=½ð6Þð9Þ

¼ :583  :182 ¼ð:401;:765Þ

■

Notice that in the foregoing example the coefficients c

, ..., c

satisfy



¼ 0. When the coefficients sum to 0, the linear combina-

tion y ¼

is called a contrast among the means, and the analysis is available

in a number of statistical software programs.

Sometimes an experiment is carried out to compare each of several “new”

treatments to a control treatment. In such situations, a multiple comparisons

technique called Dunnett’s method is appropriate.

Exercises Section 11.2 (11–21)

11. An experiment to compare the spreading rates of

five different brands of yellow interior latex

paint available in a particular area used 4 gallons

(J ¼ 4) of each paint. The sample average

spreading rates (ft

/gal) for the five brands were



1

¼ 462:0,



2

¼ 512:8,



3

¼ 437:5,

x

4

¼ 469:3, and x

5

¼ 532:1. The computed

value of F was found to be significant at level

a ¼ .05. With MSE ¼ 272.8, use Tukey’s pro-

cedure to investigate significant differences in

the true average spreading rates between brands.

12. In Exercise 11, suppose



3

¼ 427:5. Now which

true average spreading rates differ significantly

from each other? Be sure to use the method of

underscoring to illustrate your conclusions, and

write a paragraph summarizing your results.

13. Repeat Exercise 12 supposing that



2

¼ 502:8in

addition to



3

¼ 427:5

14. Use Tukey’s procedure on the data in Exercise 3

to identify differences in true average flight

times among the four types of mosquitos.

15. Use Tukey’s procedure on the data of Exercise 5

to identify differences in true average total Fe

among the four types of formations (use MSE

¼ 15.64).

16. Reconsider the axial stiffness data given in Exer-

cise 7. ANOVA output from MINITAB follows:

Analysis of Variance for stiffness

Source DF SS MS F P

length 4 43993 10998 10.48 0.000

Error 30 31475 1049

Total 34 75468

Level N Mean StDev

4 7 333.21 36.59

6 7 368.06 28.57

8 7 375.13 20.83

10 7 407.36 44.51

12 7 437.17 26.00

Pooled StDev

¼ 32.39

Tukey’s pairwise comparisons

Family error rate

¼ 0.0500

Individual error rate

¼ 0.00693

Critical value

¼ 4.10

Intervals for (column level mean) -

(row level mean)

46810

85.0

15.4

92.1 57.3

8.3 43.1

124 89.5 82.4

23.9 10.9 18.0

154.2 119.3 112.2 80.0

a. Use the output (without reference to our F

table) to test the relevant hypotheses.

b. Use the Tukey intervals given in the output to

determine which means differ, and construct

the corresponding underscoring pattern.

11.2 Multiple Comparisons in ANOVA 571

17. Refer to Exercise 4. Compute a 95% t CI for the

contrast y ¼

ðm

þ m

Þm

18. Consider the accompanying data on plant growth

after the application of different types of growth

hormone.

1 13 17 7 14

2 21 13 20 17

Hormone 3 18 15 20 17

4 7111810

5 61115 8

a. Perform an F test at level a ¼ .05.

b. What happens when Tukey’s procedure is

applied?

19. Consider a single-factor ANOVA experiment in

which I ¼ 3, J ¼ 5,



1

¼ 10,



2

¼ 12, and



3

¼ 20. Find a value of SSE for which

f > F

.05,2,12

, so that H

: m

¼ m

rejected, yet when Tukey’s procedure is applied

none of the m

’s differ significantly from each

other.

20. Refer to Exercise 19 and suppose



1

¼ 10,



2

¼ 15, and



3

¼ 20. Can you now find a value

of SSE that produces such a contradiction between

the F test and Tukey’s procedure?

21. The article “The Effect of Enzyme Inducing

Agents on the Survival Times of Rats Exposed to

Lethal Levels of Nitrogen Dioxide” (Toxicol.

Appl. Pharmacol., 1978: 169–174) reports the fol-

lowing data on survival times for rats exposed to

nitrogen dioxide (70 ppm) via different injection

regimens. There were J ¼ 14 rats in each group.

Regimen



i

(min) s

1. Control 166 32

2. 3-Methylcholanthrene 303 53

3. Allylisopropylacetamide 266 54

4. Phenobarbital 212 35

5. Chlorpromazine 202 34

6. p-Aminobenzoic acid 184 31

a. Test the null hypothesis that true average sur-

vival time does not depend on injection regi-

men against the alternative that there is some

dependence on injection regimen using

a ¼ .01.

b. Suppose that 100(1  a)% CIs for k different

parametric functions are computed from the

same ANOVA data set. Then it is easily ver-

ified that the simultaneous confidence level is

at least 100(1  ka)%. Compute CIs with

simultaneous confidence level at least 98%

for the contrasts m

–

ðm

þ m

Þand

ðm

þ m

Þm

11.3

More on Single-Factor ANOVA

In this section, we briefly consider some additional issues relating to single-factor

ANOVA. These include an alternative description of the model parameters, b for

the F test, the relationship of the test to procedures previous ly considered, data

transformation, a random effects model, and formulas for the case of unequal

sample sizes.

An Alternative Description of the ANOVA Model

The assumptions of single-factor ANOVA can be described succinctly by means of

the “model equation”

¼ m

þ e

where e

represents a random deviation from the population or true treatment mean

. The e

’s are assumed to be independent, normally distributed rv’s (implying that

the X

’s are also) with E(e

) ¼ 0 [so that E(X

) ¼ m

] and V(e

) ¼ s

[from which

V(X

) ¼ s

for every i and j]. An alternative description of single-factor ANOVA

will give added insight and suggest appropriate generalizations to models involving

more than one factor. Define a parameter m by

572 CHAPTER 11 The Analysis of Variance

m ¼

i¼1

and the parameters a

, ..., a

¼ m

 m ði ¼ 1; ...; IÞ

Then the treatment mean m

can be written as m + a

, where m represents the true

average overall response in the experiment, and a

is the effect, measured as a

departure from m, due to the ith treatment. Whereas we initially had I parameters,

we now have I+1(m, a

, ..., a

). However, because

¼ 0 (the average

departure from the overall mean response is zero), only I of these new parameters

are independently determined, so there are as many independent parameters as

there were before. In terms of m and the a

’s, the model becomes

¼ m

þ a

þ e

ði ¼ 1; ...; I; j ¼ 1; ...; JÞ

In the next two sections, we will develop analogous models for two-factor ANOVA.

The claim that the m

’s are identical is equivalent to the equality of the a

’s, and

because

¼ 0, the null hypothesis becomes

: a

¼ a

¼¼a

¼ 0

In Section 11.1, it was stated that MSTr is an unbiased estimator of s

when

is true but otherwise tends to overestimate s

. More precisely,

EðMSTrÞ¼s

I  1

When H

is true,

¼ 0soE(MSTr) ¼ s

(MSE is unbiased whether or not H

is true). If

is used as a measure of the extent to which H

is false, then a larger

value of

will result in a greater tendency for MSTr to overestimate s

. More

generally, formulas for expected mean squares for multifactor models are used to

suggest how to form F ratios to test various hypotheses .

Proof of the Formula for E(MSTr) For any rv Y, E(Y

) ¼ V(Y) + [E(Y)]

, so

EðSSTrÞ¼E

i





EðX

i

Þ

EðX



VðX

i

Þþ½EðX

i

Þ



VðX



Þþ½EðX



Þ

þ½Jðm þ a

Þ



IJs

þðIJmÞ

¼ Is

þ IJm

þ 2mJ

þ J

 s

 IJm

¼ðI  1Þs

þ J

ðsince

¼ 0Þ

The result then follows from the relationship MSTr ¼ SSTr/(I–1). ■

11.3 More on Single-Factor ANOVA 573

b for the F Test

Consider a set of parameter values a

, a

, ..., a

for which H

is not true. The

probability of a type II error, b, is the probability that H

is not rejected when that

set is the set of true values. One might think that b would have to be determined

separately for each different configuration of a

’s. Fortunately, since b for the F test

depends on the a

’s and s

only through

it can be simultaneously

evaluated for many different alternatives . For example,

¼ 4 for each of the

following sets of a

’s for which H

is false, so b is identical for all three alternatives:

1. a

¼1, a

¼ 1, a

¼ 1

2. a

¼

ﬃﬃﬃ

, a

ﬃﬃﬃ

, a

¼ 0, a

¼ 0

3. a

¼

ﬃﬃﬃ

, a

ﬃﬃﬃﬃﬃﬃﬃﬃ

1=3

, a

ﬃﬃﬃﬃﬃﬃﬃﬃ

1=3

, a

ﬃﬃﬃﬃﬃﬃﬃﬃ

1=3

The quantity J

is called the noncentrality parameter for one-way

ANOVA (because when H

is false the test statistic has a noncentral F distribution

with this as one of its parameters), and b is a decreasing function of the value of this

parameter. Thus, for fixed values of s

and J, the null hypothesis is more likely to

be rejected for alternatives far from H

(large

) than for alternatives close to

. For a fixed value of

, b decreases as the sample size J on each treatment

increases, and it increases as the variance s

increases (since greater underlying

variability makes it more difficult to detect any given departure from H

Because hand computation of b and sample size determination for the F tes t

are quite difficult (as in the case of t tests), statisticians have constructed sets of

curves from which b can be obtained. Sets of curves for numerator df n

¼ 3and

¼ 4 are displayed in Figures. 11.5 and 11.6, respectively. After the values of s

and

the a

’s for which b is desired are specified, these are used to compute the value of

f,wheref

¼ (J/I)

. We then enter the appropriate set of curves at the

54321

12 3

.99

.98

.97

.96

.95

.94

.92

.90

.80

.70

.60

.50

.40

.30

.10

Power = 1 − b

f (for a = .01)

f (for a = .05)

a = .05

a = .01

= 3

Figure 11.5 Power curves for the ANOVA

test (n

¼ 3)

(E. S. Pearson and H. O. Hartley, “Charts of the Power Function for Analysis of Variance Tests,

Derived from the Non-central

Distribution,” Biometrika, vol. 38, 1951: 112, by permission

of Biometrika Trustees.)

574 CHAPTER 11 The Analysis of Variance

value of f on the horizontal axis, move up to the curve associated with error df n

and move over to the value of power on the vertical axis. Finally, b ¼ 1 – power.

Example 11.7 The effects of four different heat treatments on yield point (tons/in

) of steel ingots

are to be investigated. A total of eight ingots will be cast using each treatment.

Suppose the true standard deviation of yield point for any of the four treatments is

s ¼ 1. How likely is it that H

will not be rejected at level .05 if three of the treatments

have the same expected yield point and the other treatment has an expected yield

point that is 1 ton/in.

greater than the common value of the other three (i.e., the fourth

yield is on average 1 standard deviation above those for the first three treatments)?

Suppose that m

¼ m

and m

¼ m

+1,m ¼ðSm

Þ=4 ¼ m

. Then

¼ m

 m ¼

, a

¼

, a

¼

, a





þ



þ



and f ¼ 1.22. The degrees of freedom are n

¼ I –1¼ 3 and n

¼ I(J –1)¼ 28,

so interpolating visually between n

¼ 20 and n

¼ 30 gives power  .47 and

b  .53. This b is rather large, so we might decide to increase the value of J.

How many ingots of each type would be required to yield b  .05 for the alterna-

tive under consideration? By trying different values of J, we can verify that J ¼ 24

will meet the requirement, but any smaller J will not.

■

As an alternative to the use of power curves, many statistical packages have a

function that calculates the cumulative area under a noncentral F curve (inputs F

numerator df, denominator df, and f

), and this area is b. In addition, MINITAB 16

.01 .05

123

12345

.99

.98

.97

.96

.95

.94

.92

.90

.80

.70

.60

.50

.40

.30

.10

Power = 1 − b

f (for a = .01)

f (for a = .05)

Figure 11.6 Power curves for the ANOVA

test (n

¼ 4)

(E. S. Pearson and H. O. Hartley, “Charts of the Power Function for Analysis of Variance Tests, Derived

from the Non-central

Distribution,”

Biometrika

, vol. 38, 1951: 112, by permission of Biometrika

Trustees.)

11.3 More on Single-Factor ANOVA

575

does something rather different. The user is asked to specify the maximum differ-

ence between m

’s rather than the individual means. For example, we might wish to

calculate the power of the test with a ¼ .05, s ¼ 1, I ¼ 4, J ¼ 2, m

¼ 100,

¼ 101, m

¼ 102, and m

¼ 106. Then the maximum difference is 106 –

100 ¼ 6. However, the power depends not only on this maximum difference but

on the values of all the m

’s. In this situation MINITAB calculates the smallest

possible value of power subject to m

¼ 100 and m

¼ 106, which occurs when the

two other m’s are both halfway between 100 and 106. This power is .86, so we can

say that the power is at least .86 and b is at most .14 when the two most extreme m’s

are separated by 6. The software will also determine the necessary common sample

size if maximum difference and minimum power are specified. The R package has

a function that allows specification of all I of the means, along with the other

parameters. The function calculates whichever parameter is omitted. For example,

in the above scenario with a ¼ .05, s ¼ 1, I ¼ 4, J ¼ 2, m

¼ 100, m

¼ 101,

¼ 102, and m

¼ 106, the function calculates power ¼ .89.

Relationship of the F Test to the t Test

When the number of populations is just I ¼ 2, the ANOVA F is testing H

: m

¼ m

versus H

: m

6¼ m

. In this case, a two-taile d, two-sample t test can also be used. In

Section 10.2, we mentioned the pooled t test, which requires equal variances, as an

alternative to the two-sample t procedure. With a little algebra, it can be shown that

the single-factor ANOVA F test and the two-tailed pooled t test are equivalent; for

any given data set, the P-values for the two tests will be identical, so the same

conclusion will be reached by either test.

The two-s ample t test is more flexible than the F test when I ¼ 2 for two

reasons. First, it is not based on the assumption that s

¼ s

; second, it can be used

to test H

: m

> m

(an upper-tailed t test) or H

: m

< m

as well as H

: m

6¼ m

.As

mentioned at the end of Section 11.1, there is a generalization of the two-sample

t test for I  3 samples with population variances not necessarily the same.

Single-Factor ANOVA When Sample Sizes Are Unequal

When the sample sizes from each population or treatment are not equal, let J

, J

, ...,

denote the I sample sizes and let n ¼ S

denote the total number of observations.

The accompanying box gives ANOVA formulas and the test procedure.

SST ¼

i¼1

j¼1

ðX

 X



i¼1

j¼1





df ¼ n  1

SSTr ¼

i¼1

j¼1

ðX

i

 X



i¼1

i





df ¼ I  1

SSE ¼

i¼1

j¼1

ðX

 X

i

¼ SST  SSTr df ¼

ðJ

 1Þ¼n  I

576 CHAPTER 11 The Analysis of Variance

Test statistic value:

f ¼

MSTr

MSE

where MSTr ¼

SSTr

I  1

and MSE ¼

SSE

n  I

Rejection region: f  F

a;I1;nI

The correction factor (CF) X



=n is subtracted when computing both SST and SSTr.

These formulas are derived in the same way (see Exercise 28) as the similar

formulas in Section 11.1, except that it is harder here to show that MSTr/MSE

has the F distribution under H

Example 11.8 The article “On the Development of a New Approach for the Determination of

YieldStrengthinMg-BasedAlloys”(Light Metal Age, Oct. 1998: 51–53) pre-

sented the following data on elastic modulus (GPa) obtained by a new ultrasonic

method for specimens of an alloy produced using three different casting

processes.

Process Observations J

i·



i

Permanent molding 45.5 45.3 45.4 44.4 44.6 43.9 44.6 44.0 8 357.7 44.71

Die casting 44.2 43.9 44.7 44.2 44.0 43.8 44.6 43.1 8 352.5 44.06

Plaster molding 46.0 45.9 44.8 46.2 45.1 45.5

6 273.5 45.58

22 983.7

Let m

, m

, and m

denote the true average elastic moduli for the three different

processes under the given circumstances. The relevant hypotheses are H

: m

¼m

versus H

: at least two of the m

’s are different. The test statistic is, of course,

F ¼ MSTr/MSE, based on I–1 ¼ 2 numerator df and n–I¼ 22 – 3 ¼ 19

denominator df. Relevant quantities include

¼ 43; 998:73 CF ¼

983:7

¼ 43; 984:80

SST ¼ 43 ; 998:73  43 ; 984:80 ¼ 13:93

SSTr ¼

357:7

352:5

273:5

 43; 984:80 ¼ 7:93

SSE ¼ 13:93  7:93 ¼ 6:00

The rem aining computations are displayed in the accompanying ANOVA table.

Since F

.001,2,19

¼ 10.16 < 12.56 ¼ f, the P-value is sma ller than .001. Thus the

null hypothesis should be rejected at any reasonable significance level; there is

compelling evid ence for concluding that true average elastic modulus somehow

depends on which casting process is used.

11.3 More on Single-Factor ANOVA 577