
CHAPTER 18
✦
Discrete Choices and Event Counts
831
3. Consider estimation of a Poisson regression model for y
i
|x
i
. The data are truncated
on the left—these are on-site observations at a recreasion site, so zeros do not
appear in the data set. The data are censored on the right—any response greater
than 5 is recorded as a 5. Construct the log-likelihood for a data set drawn under
this sampling scheme.
Applications
1. Appendix Table F17.2 provides Fair’s (1978) Redbook Magazine survey on extra-
marital affairs. The variables in the data set are as follows:
id = an identification number
C = constant, value = 1
yrb = a constructed measure of time spent in extramarital affairs
v
1
= a rating of the marriage, coded 1 to 5
v
2
= age, in years, aggregated
v
3
= number of years married
v
4
= number of children, top coded at 5
v
5
= religiosity, 1 to 4, 1 = not, 4 = very
v
6
= education, coded 9, 12, 14, 16, 17, 20
v
7
= occupation
v
8
= husband’s occupation
and three other variables that are not used. The sample contains a survey of 6,366
married women. For this exercise,we will analyze, first, the binary variable A = 1if
yr b > 0,0 otherwise. The regressors of interest are v
1
to v
8
; however, not necessarily
all of them belong in your model. Use these data to build a binary choice model for
A. Report all computed results for the model. Compute the partial effects for the
variables you choose. Compare the results you obtain for a probit model to those
for a logit model. Are there any substantial differences in the results for the two
models?
2. Continuing the analysis of the first application, we now consider the self-reported
rating, v
1
. This is a natural candidate for an ordered choice model, because the
simple five-item coding is a censored version of what would be a continuous scale on
some subjective satisfaction variable. Analyze this variable using an ordered probit
model. What variables appear to explain the response to this survey question? (Note:
The variable is coded 1, 2, 3, 4, 5. Some programs accept data for ordered choice
modeling in this form, for example, Stata, while others require the variable to be
coded 0, 1, 2, 3, 4, for example, LIMDEP. Be sure to determine which is appropriate
for the program you are using and transform the data if necessary.) Can you obtain
the partial effects for your model? Report them as well. What do they suggest about
the impact of the different independent variables on the reported ratings?
3. Several applications in the preceding chapters using the German health care data
have examined the variable DocVis, the reported number of visits to the doctor.
The data are described in Appendix Table F7.1. A second count variable in that
data set that we have not examined is HospVis, the number of visits to hospital. For
this application, we will examine this variable. To begin, we treat the full sample
(27,326) observations as a cross section.