
Exercises Section 12.2 (13–30)
13. Exercise 4 gave data on x ¼ BOD mass loading
and y ¼ BOD mass removal. Values of relevant
summary quantities are
n ¼ 14
X
x
i
¼ 517
X
y
i
¼ 346
X
x
2
i
¼ 39;095
X
y
i
¼ 17;454
X
x
i
y
i
¼ 25;825
a. Obtain the equation of the least squares line.
b. Predict the value of BOD mass removal for a
single observation made when BOD mass
loading is 35, and calculate the value of the
corresponding residual.
c. Calculate SSE and then a point estimate of s.
d. What proportion of observed variation in
removal can be explained by the approximate
linear relationship between the two variables?
e. The last two x values, 103 and 142, are much
larger than the others. How are the equation of
the least squares line and the value of r
2
affected by deletion of the two corresponding
observations from the sample? Adjust the
given values of the summary quantities, and
use the fact that the new value of SSE is
311.79.
14. The accompanying data on x ¼ current density
(mA/cm
2
) and y ¼ rate of deposition (mm/min)
appeared in the article “Plating of 60/40 Tin/
Lead Solder for Head Termination Metallurgy”
(Plating and Surface Finishing, Jan. 1997:
38–40). Do you agree with the claim by the
article’s author that “a linear relationship was
obtained from the tin–lead rate of deposition as
a function of current density”? Explain your
reasoning.
x 20 40 60 80
y .24 1.20 1.71 2.22
15. Refer to the data given in Exercise 1 on tank
temperature and efficiency ratio.
a. Determine the equation of the estimated
regression line.
b. Calculate a point estimate for true average
efficiency ratio when tank temperature is 182.
c. Calculate the values of the residuals from the
least squares line for the four observations for
which temperature is 182. Why do they not all
have the same sign?
d. What proportion of the observed variation in
efficiency ratio can be attributed to the simple
linear regression relationship between the two
variables?
16. As an alternative to the use of father’s height to
predict son’s height, Galton also used the mid-
parent height, the average of the father’s and
mother’s heights. Here are the heights of 11
female students along with their midparent
heights in inches:
Midparent 66.0 65.5 71.5 68.0 70.0 65.5 67.0
Daughter 64.0 63.0 69.0 69.0 69.0 65.0 63.0
Midparent 70.5 69.5 64.5 67.5
Daughter 68.5 69.0 64.0 67.0
a. Make a scatter plot of daughter’s height
against the midparent height and comment
on the strength of the relationship.
b. Is the daughter’s height completely and
uniquely determined by the midparent
height? Explain.
c. Use the accompanying MINITAB output to
obtain the equation of the least squares line
for predicting daughter height from midparent
height, and then predict the height of a daugh-
ter whose midparent height is 70 in. Would
you feel comfortable using the least squares
line to predict daughter height when midpar-
ent height is 74 in.? Explain.
Predictor Coef SE Coef T P
Constant 1.65 13.36 0.12 0.904
midparent 0.9555 0.1971 4.85 0.001
S ¼ 1.45061 R-Sq ¼ 72.3% R-Sq(adj) ¼69.2%
Analysis of Variance
Source DF SS MS F P
Regression 1 49.471 49.471 23.51 0.001
Residual 9 18.938 2.104
Error
Total 10 68.409
d. What are the values of SSE, SST, and the
coefficient of determination? How well does
the midparent height account for the variation
in daughter height?
e. Notice that for most of the families, the mid-
parent height exceeds the daughter height. Is
this what is meant by regression to the mean?
Explain.
17. The article “Characterization of Highway Runoff
in Austin, Texas, Area” (J. Environ. Engrg.,
1998: 131–137) gave a scatter plot, along with
12.2 Estimating Model Parameters 637