
728
PART IV
✦
Cross Sections, Panel Data, and Microeconometrics
TABLE 17.11
Estimated Random Effects Models
Constant Age Income Kids Education Married
Random 0.03411 0.02014 −0.00318 −0.15379 −0.03369 0.01633
Effects (0.09635) (0.00132) (0.06667) (0.02704) (0.00629) (0.03135)
Augmented 0.37485 0.05035 −0.03057 −0.04202 −0.05449 −0.02645
Model (0.10501) (0.00357) (0.09318) (0.03751) (0.03307) (0.05180)
−0.03659 −0.35065 −0.22509 0.02387 0.14668
Means (0.00384) (0.13984) (0.05499) (0.03374) (0.06607)
Example 17.13 Panel Data Random Effects Estimators
Example 17.11 presents several estimators of panel data estimators for the probit and logit
models. Pooled, random effects, and fixed effects estimates are given for the probit model
Prob(DocVis
it
> 0) = ( β
1
+ β
2
Age
it
+ β
3
Income
it
+ β
4
Kids
it
+β
5
Education
it
+ β
6
Married
it
).
We continue that analysis here by considering Mundlak’s approach to the common effects
model. Table 17.11 presents the random effects model from earlier, and the augmented
estimator that contains the group means of the variables, all of which are time varying.
The addition of the group means to the regression brings large changes to the estimates
of the parameters, which might suggest the appropriateness of the fixed effects model. A
formal test is carried by computing a Wald statistic for the null hypothesis that the last five
coefficients in the augmented model equal zero. The chi-squared statistic equals 113.282
with five degrees of freedom. The critical value from the chi-squared table for 95 percent
significance is 11.07, so the hypothesis that δ equals zero, that is, the hypothesis of the
random effects model (restrictions), is rejected. The two log likelihoods are −16, 273.96
for the REM and −16, 222.06 for the augmented REM. The LR statistic would be twice the
difference, or 103.8. This produces the same conclusion. The FEM appears to be the preferred
model.
A series of recent studies has sought to maintain the fixed effects specification while
correcting the bias due to the incidental parameters problem. There are two broad
approaches. Hahn and Kuersteiner (2004), Hahn and Newey (2005), and Fernandez-
Val (2009) have developed an approximate, “large T” result for plim(
ˆ
β
FE,MLE
− β)
that produces a direct correction to the estimator, itself. Fernandez-Val (2009) develops
corrections for the estimated constant terms as well. Arellano and Hahn (2006, 2007)
propose a modification of the log-likelihood function with, in turn, different first-order
estimation equations, that produces an approximately unbiased estimator of β.Ina
similar fashion to the second of these approaches, Carro (2007) modifies the first-order
conditions (estimating equations) from the original log-likelihood function, once again
to produce an approximately unbiased estimator of β. (In general, given the overall
approach of using a large T approximation, the payoff to these estimators is to reduce
the bias of the FE,MLE from O(1/T) to O(1/T
2
), which is a considerable reduction.)
These estimators are not yet in widespread use. The received evidence suggests that
in the simple case we are considering here, the incidental parameters problem is a
secondary concern when T reaches say 10 or so. For some modern public use data
sets, such as the BHPS or GSOEP which are beyond their 15th wave, the incidental
parameters problem may not be too severe. However, most of the studies mentioned
above are concerned with dynamic models (see Section 17.4.6), where the problem is
possibly more severe than in the static case. Research in this area is ongoing.