
32. Exercise 17 of Section 12.2 gave data on x ¼
rainfall volume and y ¼ runoff volume (both in
m
3
). Use the accompanying MINITAB output to
decide whether there is a useful linear relation-
ship between rainfall and runoff, and then calcu-
late a confidence interval for the true average
change in runoff volume associated with a 1-m
3
increase in rainfall volume.
The regression equation is runoff ¼
1.13 + 0.827 rainfall
Predictor Coef Stdev t-ratio P
Constant 1.128 2.368 0.48 0.642
Rainfall 0.82697 0.03652 22.64 0.000
s ¼ 5.240 R-sq ¼ 97.5% R-sq(adj) ¼97.3%
33. Exercise 16 of Section 12.2 included MINITAB
output for a regression of daughter’s height on
the midparent height.
a. Use the output to calculate a confidence inter-
val with a confidence level of 95% for the
slope b
1
of the population regression line, and
interpret the resulting interval.
b. Suppose it had previously been believed that
when midparent height increased by 1 in., the
associated true average change in the daugh-
ter’s height would be at least 1 in. Does the
sample data contradict this belief? State and
test the relevant hypotheses.
34. The invasive diatom species Didymosphenia
Geminata has the potential to inflict substantial
ecological and economic damage in rivers. The
article “Substrate Characteristics Affect Coloni-
zation by the Bloom-Forming Diatom Didymo-
sphenia Geminata”(Aquatic Ecology, 2010:
33–40) described an investigation of coloniza-
tion behavior. One aspect of particular interest
was whether y ¼ colony density was related to
x ¼ rock surface area. The article contained a
scatter plot and summary of a regression analy-
sis. Here is representative data:
x 50 71 55 50 33 58 79
y 152 1929 48 22 2 5 35
x 26 69 44 37 70 20 45 49
y 7 269 38 171 13 43 185 25
a. Fit the simple linear regression model to this
data, and then calculate and interpret the coef-
ficient of determination.
b. Carry out a test of hypotheses to determine
whether there is a useful linear relationship
between density and rock area.
c. The second observation has a very extreme
y value (in the full data set consisting of 72
observations, there were two of these). This
observation may have had a substantial
impact on the fit of the model and subsequent
conclusions. Eliminate it and redo parts (a)
and (b). What do you conclude?
35. How does lateral acceleration—side forces expe-
rienced in turns that are largely under driver
control—affect nausea as perceived by bus pas-
sengers? The article “Motion Sickness in Public
Road Transport: The Effect of Driver, Route,
and Vehicle” (Ergonomics, 1999: 1646–1664)
reported data on x ¼ motion sickness dose (calcu-
lated in accordance with a British standard for
evaluating similar motion at sea) and y ¼ reported
nausea (%). Relevant summary quantities are
n ¼ 17;
X
x
i
¼ 222:1;
X
y
i
¼ 193;
X
x
2
i
¼ 3056:69;
X
x
i
y
i
¼ 2759:6;
X
y
2
i
¼ 2975
Values of dose in the sample ranged from 6.0
to 17.6.
a. Assuming that the simple linear regression
model is valid for relating these two variables
(this is supported by the raw data), calculate
and interpret an estimate of the slope parame-
ter that conveys information about the preci-
sion and reliability of estimation.
b. Does it appear that there is a useful linear
relationship between these two variables?
Answer the question by employing the P-
value approach.
c. Would it be sensible to use the simple linear
regression model as a basis for predicting %
nausea when dose ¼ 5.0? Explain your
reasoning.
d. When MINITAB was used to fit the simple
linear regression model to the raw data, the
observation (6.0, 2.50) was flagged as possi-
bly having a substantial impact on the fit.
Eliminate this observation from the sample
and recalculate the estimate of part (a).
Based on this, does the observation appear
to be exerting an undue influence?
36. Mist (airborne droplets or aerosols) is gen-
erated when metal-removing fluids are
used in machining operations to cool and
652
CHAPTER 12 Regression and Correlation