CHAPTER 17
✦
Discrete Choice
747
we had earlier. The other three terms in the log-likelihood are derived similarly, which
produces (Maddala’s results with some sign changes):
P(y = 1, W = 1) = (x
2
β
2
+ γ,x
1
β
1
,ρ),
P(y = 1, W = 0) = (x
2
β
2
, −x
1
β
1
, −ρ),
P(y = 0, W = 1) = [−(x
2
β
2
+ γ ), x
1
β
1
, −ρ),
P(y = 0, W = 0) = (−x
2
β
2
, −x
1
β
1
,ρ).
These terms are exactly those of (17-48) that we obtain just by carrying W in the
second equation with no special attention to its endogenous nature. We can ignore the
simultaneity in this model and we cannot in the linear regression model because, in this
instance, we are maximizing the log-likelihood, whereas in the linear regression case,
we are manipulating certain sample moments that do not converge to the necessary
population parameters in the presence of simultaneity.
Example 17.21 Gender Economics Courses at Liberal Arts Colleges
Burnett (1997) proposed the following bivariate probit model for the presence of a gender
economics course in the curriculum of a liberal arts college:
Prob[G = 1, W = 1 |x
G
, x
W
] =
2
(x
G
β
G
+ γ W, x
W
β
W
, ρ).
The dependent variables in the model are
G = presence of a gender economics course
W = presence of a women’s studies program on the campus.
The independent variables in the model are
z
1
= constant term,
z
2
= academic reputation of the college, coded 1 (best), 2, ... to 141,
z
3
= size of the full-time economics faculty, a count,
z
4
= percentage of the economics faculty that are women, proportion (0 to 1),
z
5
= religious affiliation of the college, 0 = no, 1 = yes,
z
6
= percentage of the college faculty that are women, proportion (0 to 1),
z
7
–z
10
= regional dummy variables, South, Midwest, Northeast, West.
The regressor vectors are
x
G
= z
1
, z
2
, z
3
, z
4
, z
5
(gender economics course equation),
x
W
= z
2
, z
5
, z
6
, z
7
− z
10
(women’s studies program equation).
Maximum likelihood estimates of the parameters of Burnett’s model were computed by
Greene (1998) using her sample of 132 liberal arts colleges; 31 of the schools offer gender
economics, 58 have women’s studies, and 29 have both. (See Appendix Table F17.1.) The
estimated parameters are given in Table 17.17. Both bivariate probit and the single-equation
estimates are given. The estimate of ρ is only 0.1359, with a standard error of 1.2359. The Wald
statistic for the test of the hypothesis that ρ equals zero is ( 0.1359/1.2539)
2
= 0.011753.
For a single restriction, the critical value from the chi-squared table is 3.84, so the hypothesis
cannot be rejected. The likelihood ratio statistic for the same hypothesis is 2[−85.6317 −
(−85.6458) ] = 0.0282, which leads to the same conclusion. The Lagrange multiplier statistic
is 0.003807, which is consistent. This result might seem counterintuitive, given the setting.
Surely “gender economics” and “women’s studies” are highly correlated, but this finding does
not contradict that proposition. The correlation coefficient measures the correlation between
the disturbances in the equations, the omitted factors. That is, ρ measures (roughly) the
correlation between the outcomes after the influence of the included factors is accounted