
Probability 149
Interval Type
Formula
Reference
Equal-probability
K = 1.87(N −1)
0.40
[11], [12], [17]
Equal-probability
K =
√
N
[6], [15]
Equal-width
K = 3.322 log
10
N
[10]
Equal-width
K = 1.15N
1/3
[13], [14]
TABLE 5.1
Formulas for the number of histogram intervals with 95 % confidence.
4. Count the number of occurrences, n
j
(j = 1 to K), in each ∆x interval.
Check that the sum of all the n
j
’s equals N, the total number of data
points.
5. Check that the conditions for n
j
> 5 (if possible) and ∆x ≥ u
x
(defi-
nitely) are met.
6. Plot n
j
vs xm
j
, where xm
j
is discretized as the mid-point value of each
interval.
Instead of examining the distribution of the number of occurrences of
various magnitudes of a signal, the frequency of occurrences can be deter-
mined and plotted. The plot of n
j
/N = f
j
versus xm
j
is known as the
frequency distribution (sometimes called the relative frequency distri-
bution). The area bounded laterally by any two x values in a frequency
distribution equals the frequency of occurrence of that range of x values or
the probability that x will assume values in that range. Also, the sum of all
the f
j
’s equals 1. The frequency distribution often is preferred over the his-
togram because it corresponds to the probabilities of occurrence. Further, as
the sample size becomes large, the sample’s frequency distribution becomes
similar to the distribution of the population’s probabilities, which is called
the probability density function.
The distribution of all of the values of the infinitely large population
is given by its probability density function, p(x). This will be defined in
the next section. Typically p(x) is normalized such that the integral of p(x)
over all x equals unity. This effectively sets the sum of the probabilities of
all the values between −∞ and +∞ to be unity or 100 %. Similar to the
frequency distribution, the area under the portion of the probability density
function over a given measurand range equals the percent probability that
the measurands will have values in that range.
To properly compare a frequency distribution with an assumed proba-
bility density function on the same graph, the frequency distribution first
must be converted into a frequency density distribution. The frequency
density is denoted by f
∗
j
, where f
∗
j
= f
j
/∆x. This is because the probability
density function is related to the frequency distribution by the expression