
872
PART IV
✦
Cross Sections, Panel Data, and Microeconometrics
TABLE 19.6
Estimated Duration Models (estimated standard errors
in parentheses)
λ p Median Duration
Exponential 0.02344 (0.00298) 1.00000 (0.00000) 29.571 (3.522)
Weibull 0.02439 (0.00354) 0.92083 (0.11086) 27.543 (3.997)
Loglogistic 0.04153 (0.00707) 1.33148 (0.17201) 24.079 (4.102)
Lognormal 0.04514 (0.00806) 0.77206 (0.08865) 22.152 (3.954)
a covariate, the estimated Weibull model is
−ln λ = 3.7772 − 9.3515 x, p = 1.00288,
(0.1394) (2.973) (0.1217) ,
median strike length = 27.35(3.667) days, E [t] = 39.83 days.
Note that the Weibull model is now almost identical to the exponential model ( p = 1).
Because the hazard conditioned on x is approximately equal to λ
i
, it follows that the hazard
function is increasing in “unexpected” industrial production. A 1 percent increase in x leads
to a 9.35 percent increase in λ, which because p ≈ 1 translates into a 9.35 percent decrease
in the median strike length or about 2.6 days. (Note that M = ln 2/λ.)
The proportional hazard model does not have a constant term. (The baseline hazard is an
individual specific constant.) The estimate of β is −9.0726, with an estimated standard error
of 3.225. This is very similar to the estimate obtained for the Weibull model.
19.5 INCIDENTAL TRUNCATION AND
SAMPLE SELECTION
The topic of sample selection, or incidental truncation, has been the subject of an
enormous recent literature, both theoretical and applied.
24
This analysis combines both
of the previous topics.
Example 19.9 Incidental Truncation
In the high-income survey discussed in Example 19.2, respondents were also included in the
survey if their net worth, not including their homes, was at least $500,000. Suppose that
the survey of incomes was based only on people whose net worth was at least $500,000.
This selection is a form of truncation, but not quite the same as in Section 19.2. This selection
criterion does not necessarily exclude individuals whose incomes at the time might be quite
low. Still, one would expect that, on average, individuals with a high net worth would have a
high income as well. Thus, the average income in this subpopulation would in all likelihood
also be misleading as an indication of the income of the typical American. The data in such
a survey would be nonrandomly selected or incidentally truncated.
Econometric studies of nonrandom sampling have analyzed the deleterious effects
of sample selection on the properties of conventional estimators such as least squares;
have produced a variety of alternative estimation techniques; and, in the process, have
24
A large proportion of the analysis in this framework has been in the area of labor economics. See, for
example, Vella (1998), which is an extensive survey for practitioners. The results, however, have been applied
in many other fields, including, for example, long series of stock market returns by financial economists (“sur-
vivorship bias”) and medical treatment and response in long-term studies by clinical researchers (“attrition
bias”). Some studies that comment on methodological issues are Heckman (1990), Manski (1989, 1990, 1992),
and Newey, Powell, and Walker (1990).