
414 16. Microdemography
on N − N
1
, and so on, the estimate is
N ˆp
1
+(N − N
1
)ˆp
2
+ ···+(N − N
1
− N
2
−···−N
m−1
)ˆp
m
N +(N − N
1
)+···+(N − N
1
− N
2
−···−N
m−1
)
,
or, entering the estimates ˆp
1
, ˆp
2
,... from above,
N
1
+ N
2
+ ···+ N
m
N +(N − N
1
)+···+(N − N
1
− N
2
−···−N
m−1
)
, (16.2.1)
supposing that all women are followed to the mth month.
This widely used index, due to Pearl (1939, p. 296), will be referred to
as ˆp
p
. It contains the total number of conceptions during the m months of
observation in its numerator, and its denominator is the number of woman-
months of exposure, if the month of conception is counted into the exposure.
The index is intuitively appealing quite apart from the statistical argument
above. Multiplied by 1200, it gives pregnancies per 100 woman-years of in-
tercourse, and this rate is often calculated and published. Not only does ˆp
p
seem intuitively reasonable, but also, if all women were equally susceptible,
it would be the correct measure, and we would need to go no further in the
search for a measure of fecundity.
A Heterogeneous Population with Fecundity Constant for Each Woman.
However, we know that some women are more fecund than others, and
we seek from the survey a suitable average of their several p values. The
women who are most fecund will tend to become pregnant first, so the
ˆp
1
, ˆp
2
,... for the several months are estimating different quantities. The
estimate for the first month ˆp
1
= N
1
/N , refers to unselected women and
is an unbiased estimate of the mean p. Since those who become pregnant
drop out of observation, no later month refers to unselected women. The
ith month, for any i>1, omits some women selected for their fecundity,
and so the estimate derived from it, N
i
/(N −N
1
−N
2
−···−N
i−1
), must
be an underestimate of the fecundity of the original N women.
The p values differed from month to month in the sample size on which
they were estimated in the model underlying p
p
, and they differed in no
other way. With such homogeneous material the correct way to weight a
number of estimates of the same parameter is by the quantity of informa-
tion contained in each estimate, that is, by the size of sample available in
each month. With heterogeneity among women the pregnancy ratios are
genuinely different in the different months, and to weight by the quantity of
information, that is, the sample size, would be incorrect. To avoid consid-
ering two different problems at once we will now suppose that the sample
is large, so that random variation can be disregarded. What is wanted is a
population average, in which each woman counts once, and hence we must
weight the women of a given fecundity class according to the number of
women in that class in the population. (See Chapter 19 for a more general
discussion of heterogeneity.)