
9.1 Sample Average Approximation and Importance Sampling in the L -Shaped Method 393
the marginal distributions conditionally, first choosing
ξ
1
with the first marginal,
g
1
(
ξ
1
)=
ξ
2
,...,
ξ
N
g(
ξ
) d
ξ
. Then, sequentially,
ξ
i
is chosen with density, g
i
(
ξ
i
|
ξ
1
,...,
ξ
i−1
) . Remember that in each case, a random sample with density g
i
(
ξ
i
) on
an interval
Ξ
i
of ℜ can be found by choosing from a uniform random sample u
from [0,1] and then taking
ξ
such that G(
ξ
)=u where G(x)=
x
−∞
g
i
(
ξ
i
)d
ξ
i
.
Example 1
Consider Example 1 of Section 8.2 with x
1
= x
2
= x . We consider both the crude
Monte Carlo approach and the importance sampling using the sublinear approxi-
mation for g(
ξ
) . In this case, g(
ξ
) is actually chosen to depend on x as g
x
(
ξ
)
defined by:
g
x
(
ξ
)=
|x −
ξ
1
|+ |x −
ξ
2
|
E
ξ
[|x −ξ
1
|+ |x −ξ
2
|]
· (1.13)
For comparison, we first consider the L -shaped method with ξ
i
chosen by crude
Monte Carlo from the original uniform density on [0, 1] ×[0,1] and by the im-
portance sampling method with distribution g
x
(ξ) in (1.13). The results appear in
Figure 1 for the solution x
s
at each iteration s of the crude Monte Carlo and im-
portance sampling L -shaped method with
ν
= 500 on each L -shaped iteration.
The figures show up to 101 L -shaped iterations, which involve more than 50,000
recourse problem solutions.
In Figure 1, the crude Monte Carlo iteration values x appear as x(crude) while
the importance sampling iterations appear as x(imp) . We also include the optimal
solution x
∗
=
√
2 −1 on the graph. Note that x(imp) is very close to x
∗
from
just over 40 iterations while x(crude) does not appear to approach this accuracy
within 100 iterations. Note that x(imp) begins to deteriorate after 80 iterations as
the accumulation of cuts increases the probability that some cuts are actually above
Q(x) . If each cut is generated independently, this adds a bias to the results since the
expectation of the outer linearization is the expectation of the maximum of a set of
random approximations, which is greater than the maximum of the expectations of
those cuts in an exact procedure. This problem is reduced but not eliminated with
importance sampling. As a remedy, a fixed set of samples can be used to obtain
convergence for that sample set and then checked for convergence using sequential
sampling procedures as discussed in Section 9.5.
The advantage of importance sampling can also be seen in Figure 2, which com-
pares the optimal value Q(x
∗
) with sample values, Q
ν
(x
ν
) , with crude Monte
Carlo denoted as Q (crude) and Q
ν
imp
(x
ν
) with importance sampling denoted as
Q (imp). Note that the crude Monte Carlo values have a much wider variance, in
fact, double the variance of the importance sampling results. Also note that in both
sampling methods, the estimates have a mean close to the optimal value after 40
iterations.