
Probability 143
5.3 Sample versus Population
Quantitative information about a process or population usually is gathered
through an experiment. From this information, certain characteristics of
the process or population can be estimated. This approach is illustrated
schematically in Figure 5.1. The population refers to the complete col-
lection of all members relevant to a particular issue and the sample to a
subset of that population. Some populations are finite, such as the num-
ber of students in a measurements class. Some populations essentially are
infinite, such as the number of molecules in the earth’s atmosphere. Many
populations are finite but are so large that they are considered infinite, such
as the number of domestic pets in the U.S. The sample is drawn randomly
and independently from the population. Statistics of the sample, such as
its sample mean value, ¯x, and its sample variance, S
2
x
, can be com-
puted. From these statistics, the population’s parameters, which literally
are almost measurements, such as its true mean value, x
0
, and its true
variance, σ
2
, can be estimated using methods of statistical inference.
The term statistic was defined by R.A. Fisher, the renowned statistician,
as the number that is derived from observed measurements and that esti-
mates a parameter of a distribution [1]. Other useful information also can
be obtained using statistics, such as the probability that a future measurand
will have a certain value. The interval within which the true mean and true
variance are contained also can be ascertained assuming a certain level of
confidence and the distribution of all possible values of the population.
The process of sampling implicitly involves random selection. When a
measurement is made during an experiment, a value is selected randomly
from an infinite number of possible values in the measurand population.
That is, the process of selecting the next measurand value does not depend
upon any previously acquired measurand values. The specific value of the
selected measurand is a random variable, which is a real number between
−∞ and +∞ that can be associated with each possible measurand value.
So, the term random refers to the selection process and not to the often-
misinterpreted meaning that the acquired measurand values form a set of
random numbers. If the selection process is not truly random, then erroneous
conclusions about the population may be made from the sample.
5.4 Plotting Statistical Information
Usually the first thing done after an experiment is to plot the data and to
observe its trends. This data typically is a set of measurand values acquired
with respect to time or space. The representation of the variation in a mea-