
228 CHAPTER 10 / Introduction to Hypothesis Testing
decision (with ). But, as in the lower row of the table, sometimes is really
false: Then if we retain , we make a Type II error (with ), and if we reject ,
we avoid a Type II error and make the correct decision (with ).
In any experiment, the results of your inferential procedure will place you in one of
the columns of Table 10.1. If you reject , then either you’ve made a Type I error, or
you’ve made the correct decision and avoided a Type II error. If you retain , then
either you’ve made a Type II error or you’ve made the correct decision and avoided a
Type I error.
The most serious error is a Type I, concluding that an independent variable works
when really it does not. For example, concluding that new drugs, surgical techniques,
or engineering procedures work when they really do not can cause untold damage. For
this reason, researchers always use a small to minimize the likelihood of these errors.
On the other hand, a Type II error is not as harmful because it is merely failing to iden-
tify an independent variable that works. We have faith that future research will eventu-
ally discover the variable. However, we still prefer to avoid Type II errors, and for that
we need power.
Power
Of the various outcomes back in Table 10.1, the goal of research is to reject when it
is false: We conclude that the pill works, and the truth is that the pill does work. Not
only have we avoided any errors, but we have learned about a relationship in nature.
This ability has a special name: Power is the probability that we will reject when it
is false, correctly concluding that the sample data represent a relationship. In other
words, power is the probability of not making a Type II error, so power equals
Power is important because, after all, why bother to conduct a study if we’re unlikely
to reject the null hypothesis even when there is a relationship present? Therefore,
power is a concern anytime we do not reject because we wonder, “Did we just miss
a relationship?” For example, previously, when we did not find a significant effect of
the pill, maybe the problem was that we lacked power: Maybe we were unlikely to
reject even if the pill really worked.
To avoid this doubt, we strive to maximize the power of a study (maximizing the size
of ). Then we’ll have confidence in our decision if we do ultimately retain null.
Essentially, the idea is to do everything we can to ensure that in case we end up in the
Type II situation where there is a relationship in nature, we—and our statistics—will
not miss the relationship. If we still end up retaining , we know that it’s not for lack
of trying. We’re confident that if the relationship was there, we would have found it,
so it must be that the relationship is not there. Therefore, we are confident in the deci-
sion to retain , and, in statistical lingo, we say that we’re confident we have avoided
a Type II error.
REMEMBER We seek to maximize power so that, if we retain , we are con-
fident we are not making a Type II error.
The time to build in power is when we design a study. We’re talking about being in
the Type II situation here, so it’s a given that the relationship exists in nature. We can’t
do anything to ensure that we’re in this situation (that’s up to nature), but assuming we
are, then the goal is to have significant results. Therefore, we increase power by
increasing the likelihood that our results will be significant. Results are significant if
is larger than , so anything that increases the size of the obtained value relative
to the critical value increases power.
z
crit
z
obt
H
0
H
0
H
0
1 2 
H
0
H
0
1 2 .
H
0
H
0
␣
H
0
H
0
p 5 1 2 
H
0
p 5 H
0
H
0
p 5 1 2 ␣