
10.3 Scenario Generation and Distribution Fitting 427
solution converges asymptotically to a solution of the model with the underlying
distribution.
In the following, we assume that an underlying distribution is known, although,
as elsewhere in this book, this can be interpreted in the Bayesian sense that the
underlying distribution represents the prior belief of the decision maker. For the
development here, we assume the structure of the multistage stochastic linear pro-
gram in (3.4.1), although extensions to nonlinear models are straightforward. The
random parameters in period t are ξ
t
=
ξ
t
(
ω
) . A basic sampling method would
be to take K
1
independent and identically distributed draws,
ξ
1
1
,...,
ξ
1
K
1
, from
ξ
1
and then recursively to draw K
t
samples from ξ
t
conditional on
ξ
1
k
1
,...,
ξ
t−1
k
t−1
where 1 ≤k
s
≤
Π
s
i=1
K
i
, s = 1,...,t −1 for each of the K
t−1
=
Π
t−1
i=1
K
i
possible
scenarios in the sampled decision tree through period t −1.When ξ
t
is serially
independent (i.e., the distribution is the same for all realizations of the history pro-
cess at time t −1forallt ), the same ξ
t
samples may be used along any branch of
the tree, but, in stochastic programming, we assume that optimal decisions may be
path-dependent and, therefore, that the exponential increase in the size of the tree is
necessary to capture all possible future actions.
To keep the sizes of decision trees manageable for computation, stochastic pro-
gramming models generally limit the size of the sample tree so that K
t
is relatively
small (and may be decreasing in t ). To help ensure that the solution of the sample
problem suffers as little as possible from small-sample bias, sample scenario gener-
ation in multistage models often aims to ensure that the sample distribution shares
important characteristics, such as moments and quantiles, with the underlying dis-
tribution of ξ .
To see how multistage sampling works in practice, we consider the investment
model from Section 1.2, where instead of the two possible values in each period, we
suppose that the returns ξ
t
are lognormally distributed where logξ
t
∼ N(
μ
,
Σ
) ,a
bivariate normally distributed random vector with mean
μ
=
0.141
0.122
and vari-
ance/covariance matrix
Σ
= 10
−3
6.740 0.291
0.291 0.0784
. This distribution gives the
same mean and variance for each component of ξ
t
as in Section 1.2, but, instead
of being perfectly correlated, the correlation between the stock and bond is 0.4. In
particular, the mean return of each asset i is
¯
ξ
i
= e
μ
i
+
1
2
σ
ii
, written as
¯
ξ
=
1.155
1.130
, (3.1)
and the covariances are E[(ξ(i) −
¯
ξ
(i))(ξ( j) −
¯
ξ
( j))] = e
μ
i
+
μ
j
+
σ
ii
+
σ
jj
2
(e
σ
ij
−1) ,
which we write collectively as the matrix V ,where
V = 10
−3
9.027 0.380
0.380 0.100
, (3.2)