
d. Let m ¼ 3 and n ¼ 3, with y
1
¼ 117,
y
2
¼ 119, y
3
¼ 127, y
4
¼ 129, y
5
¼ 138,
y
6
¼ 139. These are the prices in thousands
for three houses in Brookwood and then three
houses in Pleasant Hills. Apply parts (a), (b),
and (c) to this data set.
96. The constant term is not always needed in the
regression equation. For example, many physical
principles involve proportions, where no con-
stant term is needed. In general, if the dependent
variable should be 0 when the independent vari-
ables are 0, then the constant term is not needed.
Then it is preferable to omit b
0
and use the model
Y ¼ b
1
x
1
þ b
2
x
2
þþb
k
x
k
þ e. Here we
focus on the special case k ¼ 1.
a. Differentiate the appropriate sum of squares
to derive the one normal equation for estimat-
ing b
1
.
b. Express your normal equation in matrix
terms, X
0
Xb ¼ X
0
y, where X consists of one
column with the values of the predictor vari-
able.
c. Apply part (b) to the data of Example 12.32,
using hp for y and just engine size in X.
d. Explain why deletion of the constant term
might be appropriate for the data set in part (c).
e. By fitting a regression model with a constant
term added to the model of part (c), test the
hypothesis that the constant is not needed.
97. Assuming that the analysis of variance table is
available, show how the last three columns of
Figure 12.37 (the columns related to residuals)
can be obtained from the previous columns.
98. Given that the residuals are y
^
y ¼ðI HÞy,
show that CovðY
^
YÞ¼ I HðÞs
2
.
99. Use Equations (12.26) and (12.27) to show that
each of the leverages is between 0 and 1, and
therefore the variances of the predicted values
and residuals are between 0 and s
2
.
100. Consider the special case y ¼ b
0
þ b
1
x þ e,so
k ¼ 1 and X consists of a column of 1’s and a
column of the values x
1
, ..., x
n
of x.
a. Write the normal equations in matrix form,
and solve by inverting X
0
X.[Hint:ifad 6¼ bc,
then
ab
cd
1
¼
1
ad bc
d b
ca
Check your answers against those in Sec-
tion 12.2.]
b. Use the inverse of X
0
X to obtain expressions
for the variances of the coefficients, and
check your answers against the results given
in Sections 12.3 and 12.4 (
^
b
0
is the predicted
value corresponding to x* ¼ 0).
c. Compare the predictions from this model with
the predictions from the model of Exercise 94.
Comparing other aspects of the two models,
discuss similarities and differences. Mention,
in particular, the hat matrix, the predicted
values, and the residuals.
101. Continue Exercise 94.
a. Find the elements of the hat matrix and use
them to obtain the variance of the predicted
values. Noting the result of Exercise 100(c),
compare your result with the expression for
Vð
^
YÞ given in Section 12.4.
b. Using the diagonal elements of H, obtain the
variances of the residuals and compare with
the expression given in Section 12.6
c. Compare the variances of predicted values
for an x that is close to
x and an x that is far
from
x.
d. Compare the variances of residuals for an x
that is close to
x and an x that is far from x .
e. Give intuitive explanations for the results of
parts (c) and (d).
102. Carry out the details of the derivation for the
analysis of variance, Equation (12.20).
103. The measurements here are similar to those in
Example 12.36, except that here the students did
the measurements at home, and the results suf-
fered in accuracy. These are measurements from
a sample of ten students:
Wingspan Foot Height
74 13.0 75
56 8.5 66
65 10.0 69
66 9.5 66
62 9.0 54
69 11.0 72
75 12.0 75
66 9.0 63
66 9.0 66
63 8.5 63
a. Regress wingspan on the other two variables.
Carry out the test of model utility and the tests
for the two individual regression coefficients
of the predictors.
12.8 Regression with Matrices 717