
608 Chapter 15
Exploring types of data
Up until now, the sort of data we’ve been dealing with has been univariate.
Univariate data concerns the frequency or probability of a single variable. As an
example, univariate data could describe the winnings at a casino or the weights of
brides in Statsville. In each case, just one thing is being described.
What univariate data can’t do is show you connections between sets of data. For
example, if you had univariate data describing the attendance figures at an open air
concert, it wouldn’t tell you anything about the predicted hours of sunshine on that
day. It would just give you figures for concert attendance.
So what if we do need to know what the connection is between variables? While
univariate data can’t give us this information, there’s another type of data that
can—bivariate data.
Open Air Concert Attendance
Attendance
Frequency
Univariate data for concert
attendance tells you nothing
about the hours of sunshine.
All about bivariate data
Bivariate data gives you the value of two variables for each observation, not
just one. As an example, it can give you both the predicted hours of sunshine and
the concert attendance for a single event or observation, like this.
Sunshine (hours)
1.9 2.5 3.2 3.8 4.7 5.5 5.9 7.2
Concert attendance (100’s)
22 33 30 42 38 49 42 55
Bivariate data gives you the
value of two variables for each
observation.
If one of the variables has been controlled in some way or is used to explain the
other, it is called the independent or explanatory variable. The other variable
is called the dependent or response variable. In our example, we want to use
sunshine to predict attendance, so sunshine is the independent variable, and
attendance is the dependent.
introducing bivariate data