
618 Chapter 15
0 1 2 3 4 5 6 7 8
0
10
20
30
40
50
60
A line of best fit? And
you just guess what the line is
based on what looks good to
you? That’s hardly scientific.
Drawing the line in this way is just a best guess.
The trouble with drawing a line in this way is that it’s an estimate, so
any predictions you make on the basis of it can be suspect. You have
no precise way of measuring whether it’s really the best fitting line. It’s
subjective, and the quality of the line’s fit depends on your judgment.
Predict values with a line of best fit
So far you’ve seen how scatter diagrams can help you see whether there’s a
correlation between values, by showing you if there’s some sort of pattern.
But how can you use this to predict concert attendance, based on the
predicted amount of sunshine? How would you use your existing scatter
diagram to predict the concert attendance if you know how many hours of
sunshine are expected for the day?
One way of doing this is to draw a straight line through the points on the
scatter diagram, making it fit the points as closely as possible. You won’t be
able to get the straight line to go through every point, but if there’s a linear
correlation, you should be able to make sure every point is reasonably close
to the line you draw. Doing this means that you can read off an estimate
for the concert attendance based on the predicted amount of sunshine.
Here’s your original
scatter diagram.
Here’s the line. It goes straight
through the heart of where the
data points are.
sunshine (hours)
attendance (100’s)
You can use the line to estimate
concert attendance for a certain
number of hours predicted sunshine.
The line that best fits the data points is called the line of best fit.
line of best fit