THE IMPACT OF COMPETING RISKS 297
Example 11.2 Suppose we are studying a chronic but fluctuating disease for which there
naturally occur periods of symptom worsening, so-called exacerbations. Our treatment is
expected to reduce the intensity of the occurrence of such exacerbations, and we have a
placebo control. As part of the protocol the patient is allowed to discontinue participation in
the study whenever he wishes, and in doing so he should state his reasons for discontinuation.
One such reason may be a deterioration of the disease under study. Even though such a
withdrawal is not an exacerbation per se, not fulfilling the precise medical criteria for this, it
may well be closely related to it. It may, for example, precede it. If we study the time to the first
exacerbation and censor these withdrawals, we probably will not do a meaningful analysis. It
may be better to redefine the event under study as either exacerbation or discontinuation due
to deterioration of the disease.
If the variable T
1
exists, we have that d(t) = P(T
1
= t|T
1
≥ t). We can only measure
d
∗
(t), which works conditionally on T ≥ t, so if we want to substitute d
∗
(t) for d(t), we
need the cause-specific hazard to stay the same if we remove subjects experiencing competing
events. Essentially that means that our event and the competing events must act independently
of each other.
This, like the ITT analysis we discussed in Section 3.7, is something that is often dif-
ficult for medical researches to accept: ‘why can’t I analyze non-fatal and fatal myocardial
infarctions separately?’ The answer is that even though it may make medical sense to do this,
there is simply not enough data to do a valid statistical analysis. Instead we need to make
a fundamental assumption, that competing risks act independently of the event of interest,
so it is the shortcomings of statistics (or, rather, the amount of information) that hinder the
fulfillment of this wish.
If we can somehow assume independence, we can estimate (t) from data, considering
those individuals with a competing event to be censored, and thereby obtain the Kaplan–Meier
estimate of the CDF for T
1
. It may also be that we want to eliminate only some censoring
reasons. Suppose we are studying the events ‘death from myocardial infarction’ and ‘death
from other causes’, but also that patients in the study may be lost to follow-up. In order to be
able to estimate the CIF for death from myocardial infarction in the presence of other causes
of death, we want to eliminate the problem posed by those who were lost to follow-up and to
study termination. This is done by use of the formula
G(t) =
t
0
F
c
(s−)d(s),
where d(t) is cause-specific and is estimated from the Nelson–Aalen estimator, and F
c
(t)
is estimated by the Kaplan–Meier estimate for total survival. In this way we get an estimate
of the CIF, eliminating the study-specific censoring mechanisms.
One lesson from this discussion is that the Kaplan–Meier estimate relates to an abstract
time-to-event variable, free of any competing events. To describe the results of an analysis
in terms of the Kaplan–Meier estimate may therefore be somewhat artificial in situations
where there are competing risks that cannot be eliminated. In such a situation it may be more
relevant to compute something like the function G
1
(t)/G
c
2
(t). This represents the conditional
probability for the event prior to time t, provided that the none of the competing risks have
yet occurred. This is how the Kaplan–Meier estimate is sometimes (erroneously) interpreted.
The Kaplan–Meier estimate is an estimate of the CDF for the event in the absence of other,
competing, events. The suggested function provides the probability of obtaining an event of