
12.3 Coefficient of Determination 503
where
b
1
the slope of the estimated regression equation y
ˆ
b
0
b
1
x
The sign for the sample correlation coefficient is positive if the estimated regression equa-
tion has a positive slope (b
1
0) and negative if the estimated regression equation has a
negative slope (b
1
0).
For the Armand’s Pizza Parlor example, the value of the coefficient of determination
corresponding to the estimated regression equation 60 5x is .9027. Because the
slope of the estimated regression equation is positive, equation (12.13) shows that the
sample correlation coefficient is With a sample correlation co-
efficient of r
xy
.9501, we would conclude that a strong positive linear association
exists between x and y.
In the case of a linear relationship between two variables, both the coefficient of deter-
mination and the sample correlation coefficient provide measures of the strength of the re-
lationship. The coefficient of determination provides a measure between zero and one,
whereas the sample correlation coefficient provides a measure between 1 and 1. Al-
though the sample correlation coefficient is restricted to a linear relationship between two
variables, the coefficient of determination can be used for nonlinear relationships and for
relationships that have two or more independent variables. Thus, the coefficient of deter-
mination provides a wider range of applicability.
兹
.9027 .9501.
y
ˆ
test
SELF
NOTES AND COMMENTS
1. In developing the least squares estimated re-
gression equation and computing the coefficient
of determination, we made no probabilistic as-
sumptions about the error term , and no statis-
tical tests for significance of the relationship
between x and y were conducted. Larger values
of r
2
imply that the least squares line provides a
better fit to the data; that is, the observations are
more closely grouped about the least squares
line. But, using only r
2
, we can draw no conclu-
sion about whether the relationship between x
and y is statistically significant. Such a conclu-
sion must be based on considerations that in-
volve the sample size and the properties of the
appropriate sampling distributions of the least
squares estimators.
2. As a practical matter, for typical data found in
the social sciences, values of r
2
as low as .25 are
often considered useful. For data in the physical
and life sciences, r
2
values of .60 or greater are
often found; in fact, in some cases, r
2
values
greater than .90 can be found. In business appli-
cations, r
2
values vary greatly, depending on the
unique characteristics of each application.
Exercises
Methods
15. The data from exercise 1 follow.
x
i
12345
y
i
3751114
The estimated regression equation for these data is .20 2.60x.
a. Compute
SSE, SST, and SSR using equations (12.8), (12.9), and (12.10).
b. Compute the coefficient of determination r
2
. Comment on the goodness of fit.
c. Compute the sample correlation coefficient.
y
ˆ
CH012.qxd 8/16/10 6:58 PM Page 503
Copyright 2010 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s).
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it.