P1: OTA/XYZ P2: ABC
JWST061-07 JWST061-Caers April 6, 2011 13:20 Printer Name: Yet to Come
7.2 PROBABILITY-BASED APPROACHES 109
A data source often provides only partial information about what we are trying to
model. For example, seismic data (Chapter 8) does not provide a measurement of porosity
or permeability, properties important to flow in porous media; instead, it provides mea-
surements that are indicators of the level of porosity. Satellite data in climate modeling
do not provide direct, exact information on temperature, only indicators of temperature
changes. Given these criteria, we will proceed in two steps in order to include these data
in our model of uncertainty:
1 Calibration step: how much information is contained in each data source? Or, what is
the “information content” of each data source?
2 Integration step: how do we combine these various sources of information content
into a single model of uncertainty?
7.2.2 Calibration of Information Content
The amount of information contained in a data source is dependent on many factors, such
as: the measurement configuration, the measurement error, the physics of the measure-
ment, the scale of modeling, and so on.
The first question that should be asked is: how do we model quantitatively the infor-
mation content of a data source. In probabilistic methods a conditional distribution is
used (Chapter 2). Recall that a conditional distribution P(A|B) models the uncertainty of
some target variable A, given some information B. In our case B will be the data source
and A will be what is being modeled. Recall also that if P(A|B) = P(A) then B carries no
information on A. The question now is how is P(A|B) determined?
To determine such conditional probability, more information is needed, more specif-
ically, we need data pairs (a
i
, b
i
), that is, mutual or joint observations of what we are
trying to model and the data source. This means that at some limited set of locations it is
necessary to have observed the true Earth as well as the data source. In many applications,
at the sample locations, it is possible to have information on A as well as B.
For example, from wells, there may be measurements of porosity and from seismic
measurements of seismic impedance (or any other seismic attribute) in 3D (such as shown
in Figure 7.1). This provides pairs of porosity and impedance measurements that can be
used to plot a scatter plot, such as shown in Figure 7.2. In this scatter plot P(A|B) can now
be calculated, where the event “A = (porosity < t)”, for some threshold t, and “B = (s <
impedance < s + s)”, as shown in Figure 7.2. In this way a new function is created:
P(A|B) = (t, s)
Once we have this function, the conditional probability for any t and any s can be
evaluated. This function is a “calibration function” that measures how much information
impedance carries about porosity. There are many other ways to get this function. Physical
approaches such as rock physics may provide this function, or one may opt for statistical
techniques (e.g., regression methods such neural networks) to “lift” this function from a
data set belonging to another field if what occurs in that field is deemed similar.