814
PART IV
✦
Cross Sections, Panel Data, and Microeconometrics
The infinite sum is computed by using the complement. Thus,
E[y
i
|x
i
] = λ
i
−
⎡
⎣
∞
j=0
( j − C)P
i, j
−
C−1
j=0
( j − C)P
i, j
⎤
⎦
= λ
i
−
(
λ
i
− C
)
+
C−1
j=0
( j − C)P
i, j
= C −
C−1
j=0
(C − j)P
i, j
.
Example 18.9 Extramarital Affairs
In 1969, the popular magazine Psychology Today published a 101-question survey on sex
and asked its readers to mail in their answers. The results of the survey were discussed in
the July 1970 issue. From the approximately 2,000 replies that were collected in electronic
form (of about 20,000 received), Professor Ray Fair (1978) extracted a sample of 601 ob-
servations on men and women then currently married for the first time and analyzed their
responses to a question about extramarital affairs. Fair’s analysis in this frequently cited study
suggests several interesting econometric questions. [In addition, his 1977 companion paper
in Econometrica on estimation of the tobit model contributed to the development of the EM
algorithm, which was published by and is usually associated with Dempster, Laird, and Rubin
(1977).]
Fair used the tobit model that we discuss in Chapter 19 as a platform The nonexperimental
nature of the data (which can be downloaded from the Internet at http://fairmodel.econ.yale
.edu/rayfair/work.ss.htm and are given in Appendix Table F18.1). provides a laboratory case
that we can use to examine the relationships among the tobit, truncated regression, and
probit models. Although the tobit model seems to be a natural choice for the model for these
data, given the cluster of zeros, the fact that the behavioral outcome variable is a count that
typically takes a small value suggests that the models for counts that we have examined in
this chapter might be yet a better choice. Finally, the preponderance of zeros in the data that
initially motivated the tobit model suggests that even the standard Poisson model, although
an improvement, might still be inadequate. We will pursue that aspect of the data later. In this
example, we will focus on just the censoring issue. Other features of the models and data
are reconsidered in the exercises.
The study was based on 601 observations on the following variables (full details on data
coding are given in the data file and Appendix Table F18.1):
y = number of affairs in the past year, 0, 1, 2, 3, 4–10 coded as 7
“monthly, weekly, or daily,” coded as 12. Sample mean = 1.46
Frequencies = (451, 34, 17, 19, 42, 38),
z
1
= sex = 0 for female, 1 for male. Sample mean = 0.476,
z
2
= age. Sample mean = 32.5,
z
3
= number of years married. Sample mean = 8.18,
z
4
= children, 0 = no, 1 = yes. Sample mean = 0.715,
z
5
= religiousness, 1 = anti, ...,5 = very. Sample mean = 3.12,
z
6
= education, years, 9 = grade school, 12 = high school, ...,20= Ph.D or other Sample
mean = 16.2,
z
7
= occupation, “Hollingshead scale,” 1–7. Sample mean = 4.19,
z
8
= self-rating of marriage, 1 = very unhappy, ...,5 = very happy. Sample mean = 3.93.