
344 REMARKS ON SOME ESTIMATION METHODS
This second part continues the discussion on the subject-specific approach to dose
response, initiated in Section 10.6, except that this time we will not estimate the model
by reducing it to a population averaged model. Here we will take as our starting point the
likelihood for the problem. The problem with the likelihood approach for these non-linear,
though not for the linear, mixed-effects models is that their likelihood consists of multidi-
mensional integrals that are hard to compute accurately. Methods exist, but they are often too
computer-intensive at present for routine use, so we often need to use various approximations.
We will have a quick look at some of these approximations and demonstrate why we need
to understand how the estimation method works, in order to understand what we are actually
estimating – the population averaged or the subject-specific aspect of the problem.
13.2 Estimating equations and the robust variance estimate
The way we have carried out estimation in statistical models has mostly followed a certain
pattern. We start with a function U(x, θ) of the data x (both outcome variables and explanatory
variables) and an unknown parameter vector θ, such that the equation U(x, θ) = 0 defines
our estimate of θ as a function of x. Important examples of such functions U(x, θ) include
U(x, θ) = x
11
x
22
− θx
12
x
21
and U(x, θ) = x
11
− E
θ
(x
11
) used to estimate the odds ratio in
Section 5.3, the function
G
m
(x, θ)dF
n
(x) −
1
2
associated with the Wilcoxon test, and the
different functions discussed in Chapter 12, modeling survival data. Some of these will be
revisited below. The function U(x, θ), together with the sampling scheme, defines a stochastic
variable which depends on a parameter, which we denote by U(θ). The estimating equation
is then written as U(θ) = 0. As usual the point estimate of θ obtained from a particular set of
data is denoted θ
∗
, whereas the general solution of the equation (which is a function of the
data) is the estimator of θ, is a stochastic variable and is denoted by
ˆ
θ.
The basic requirement on U(θ) is that it is unbiased, which in this case means that it has
mean zero:
E
θ
(U(θ)) =
U(x, θ)dF
θ
(x) = 0.
Here F
θ
(x) is the CDF for the data set, a multidimensional CDF which we do not need to
be explicit about. It is important to note, and will be discussed further below, that it is the
equation that is unbiased; the estimator
ˆ
θ need not be so. Usually the setup is such that we
have a stochastic variable X, representing a unit of data (data from one subject), together
with a function U(x, θ) such that E
θ
(U(X, θ)) = 0, and we have at our disposal a set of n
independent observations x
i
of X from which θ should be estimated. This means that we
estimate θ from the average of the observed values of U(X, θ),
1
n
n
i=1
U(x
i
,θ) = 0.
(Note that when θ is a vector with s components, we need the estimating function to consist
of s independent components in order for this to be solvable.) This is useful, since it allows
us to appeal to large-sample theory when we wish to derive confidence statements about θ.In
fact, we often appeal to the CLT to obtain that
U(θ) ∈ AsN
s
(0,V
θ
(U(θ)).