
THE ANALYSIS OF RECURRENT EVENTS 355
Next we consider the information matrix. The derivative of U(θ) when X = x consists of
−f
(θ, x)
t
f
(θ, x)/σ
2
(θ, x) plus a term involving Y − f (θ, x) for which the mean is zero. Its
expected value (conditional on X = x) is therefore the same as above, and under the modeling
assumption the information matrix is the same as the variance of the estimator. Finally, the
model-free estimate of the variance of U(θ)is
1
n
n
i=1
U(x
i
,θ)U(x
i
,θ)
t
=
1
n
n
i=1
σ(θ, x
i
)
−4
f
(θ, x
i
)
t
f
(θ, x
i
)(y
i
− f (θ, x
i
))
2
.
Combining this with the information matrix above gives the robust variance for GLMs.
13.4 The analysis of recurrent events
To further illustrate that we can analyze a misspecified model for our data and still make an
acceptable inference by using the robust variance estimator, we now study the problem with
recurrent events that was introduced in Section 11.6. In that discussion we did not illustrate
the analysis with any examples, because we wanted the robust variance introduced first. We
now have it, so we will remedy that omission in this section. We noted in Section 11.6 that we
cannot readily apply methods developed for single events in individuals to data consisting of
recurrent events within individuals, because it requires a crucial assumption: all events must
be independent, which would not be consistent with an expected patient heterogeneity in, for
example, disease severity. As noted then, the simplest way to handle this problem is to reduce
the data to the total number of events for each individual, ignoring when in time individual
events occur. Instead we assume that there is, for each individual, a mean intensity over
the period, defined by the expected number of events, divided by the observation time. If we
assume that all individuals have the same constant hazard, with events occurring independently
of each other, we could analyze these count data as a Poisson regression problem, adjusting
for observation time. However, neither the assumption of homogeneity between patients, nor
the within-patient independence of events, is likely to hold. This suggests the possibility of
overdispersion, as illustrated in the next example.
Example 13.8 In a one-year study the investigators wanted to assess the effect on the
occurrence of asthma exacerbations when a long-acting brochodilator drug (LABA) was
added to an inhaled corticosteroid (ICS). The particular study had four treatment arms defined
by two doses of ICS, separated by a factor of 4, each of which was studied with and without
concomitant administration of the LABA. We will analyze this in two steps, as discussed
in Chapter 9, and first analyze the number of exacerbations using a Poisson regression
model approach.
In this step we estimate the group means of the exacerbation rates, adjusting for possible
predictive factors for exacerbations. One set of predictors are related to the fact that patients
with more severe asthma are expected to have more exacerbations. Both the FEV
1
as percent-
age of predicted normal
1
and the log of the ICS dose at enrollment were considered to be such
variables. Other covariates included were body mass index and sex, together with the basic
indicator variables for the four treatment groups. With these variables in a Poisson regression
model, the output provides us with the adjusted exacerbations rates shown in Figure 13.3.
1
This is a prediction of what FEV
1
should have been for a healthy individual of the same height, age and sex.