
CHAPTER 18
✦
Discrete Choices and Event Counts
821
Example 18.10 Panel Data Models for Doctor Visits
The German health care panel data set contains 7,293 individuals with group sizes ranging
from 1 to 7. Table 18.17 presents the fixed and random effects estimates of the equation
for DocVis. The pooled estimates are also shown for comparison. Overall, the panel data
treatments bring large changes in the estimates compared to the pooled estimates. There
is also a considerable amount of variation across the specifications. With respect to the
parameter of interest, Public, we find that the size of the coefficient falls substantially with
all panel data treatments. Whether using the pooled, fixed, or random effects specifications,
the test statistics (Wald, LR) all reject the Poisson model in favor of the negative binomial.
Similarly, either common effects specification is preferred to the pooled estimator. There is no
simple basis for choosing between the fixed and random effects models, and we have further
blurred the distinction by suggesting two formulations of each of them. We do note that the
two random effects estimators are producing similar results, which one might hope for. But,
the two fixed effects estimators are producing very different estimates. The NB1 estimates
include two coefficients, Income and Education, which are positive, but negative in every
other case. Moreover, the coefficient on Public, which is large and significant throughout the
table, has become small and less significant with the fixed effects estimators.
We also fit a three-class latent class model for these data. (See Section 14.10.) The
three class probabilities were modeled as functions of Married and Female, which appear
from the results to be significant determinants of the class sorting. The average prior prob-
abilities for the three classes are 0.09212, 0.49361, and 0.41427. The coefficients on Public
in the three classes, with associated t ratios are 0.3388 (11.541), 0.1907 (3.987), and 0.1084
(4.282). The qualitative result concerning evidence of moral hazard suggested at the outset of
Example 18.7 appears to be supported in a variety of specifications (with FE-NB1 the sole
exception).
18.4.8 TWO-PART MODELS: ZERO INFLATION AND HURDLE
MODELS
Mullahy (1986), Heilbron (1989), Lambert (1992), Johnson and Kotz (1993), and Greene
(1994) have analyzed an extension of the hurdle model in which the zero outcome can
arise from one of two regimes.
15
In one regime, the outcome is always zero. In the other,
the usual Poisson process is at work, which can produce the zero outcome or some
other. In Lambert’s application, she analyzes the number of defective items produced
by a manufacturing process in a given time interval. If the process is under control, then
the outcome is always zero (by definition). If it is not under control, then the number
of defective items is distributed as Poisson and may be zero or positive in any period.
The model at work is therefore
Prob(y
i
= 0|x
i
) = Prob(regime 1) + Prob(y
i
= 0|x
i
, regime 2)Prob(regime 2),
Prob(y
i
= j|x
i
) = Prob(y
i
= j|x
i
, regime 2)Prob(regime 2), j = 1, 2,....
Let z denote a binary indicator of regime 1(z = 0) or regime 2 (z = 1), and let y
∗
denote
the outcome of the Poisson process in regime 2. Then the observed y is z×y
∗
. A natural
extension of the splitting model is to allow zto be determined by a set of covariates. These
covariates need not be the same as those that determine the conditional probabilities
in the Poisson process. Thus, the model is
Prob(z
i
= 0 |w
i
) = F(w
i
, γ ), (Regime 1 : y will equal zero.)
Prob(y
i
= j |x
i
, z
i
= 1) =
exp(−λ
i
)λ
j
i
j!
.(Regime 2 : y will be a count outcome.)
15
The model is variously labeled the “with zeros,” or WZ, model [Mullahy (1986)], the zero inflated Poisson,
or ZIP, model [Lambert (1992)], and “zero-altered poisson,” or ZAP, model [Greene (1994)]