
301
Chapter 15: Correlation: The Rise and Fall of Relationships
To use this function, the steps are:
1. Type the data into cell arrays and select a cell for CORREL’s answer.
I’ve entered the SAT data into C3:C22 and the GPA data into D3:D22, and
selected F15. I’ve defined SAT as the name of C3:C22 and GPA as the
name of D3:D22. (Reread Chapter 2 to see how to do this.)
2. From the Statistical Functions menu, select CORREL to open its
Function Arguments dialog box.
3. In the Function Arguments dialog box, enter the appropriate values
for the arguments.
In the Array 1 box, I entered SAT — the name I assigned to the cell range
(C3:C22) that holds the scores for one of the variables.
In the Array 2 box, I entered GPA — the name I assigned the cell range
(D3:D22) that holds the scores for the other variable.
With values entered for each argument, the answer, 0.81662505, appears
in the dialog box.
4. Click OK to put the answer into the selected cell.
Selecting PEARSON instead of CORREL gives you exactly the same
answer, and you use it exactly the same way.
Item analysis: A useful application of correlation
Instructors often want to know how perfor-
mance on a particular exam question is related
to overall performance on the exam. Ideally,
someone who knows the material answers
the question correctly; someone who doesn’t
answers it incorrectly. If everyone answers it
correctly — or if no one does — it’s a useless
question. This evaluation is called item analysis.
Suppose it’s possible to answer the exam
question either correctly or incorrectly, and
it’s possible to score from 0 to 100 on the exam.
Arbitrarily, you can assign a score of 0 for an
incorrect answer to the question, and 1 for a
correct answer, and then calculate a correla-
tion coefficient where each pair of scores is
either 0 or 1 for the question and a number from
0 to 100 for the exam. The score on the exam
question is called a dichotomous variable, and
this type of correlation is called point biserial
correlation.
If the point biserial correlation is high for an
exam question, it’s a good idea to retain that
question. If the correlation is low, the question
probably serves no purpose.
Because one of the variables can only be 0 or
1, the formula for the biserial correlation coef-
ficient is a bit different from the formula for the
regular correlation coefficient. If you use Excel
for the calculations, however, that doesn’t
matter. Just use CORREL (or PEARSON) in the
way I outline.
21 454060-ch15.indd 30121 454060-ch15.indd 301 4/21/09 7:34:49 PM4/21/09 7:34:49 PM