
7.1
General Concepts and Criteria
Statistical inference is frequently directed toward drawing some type of conclusion
about one or more parameters (population characteristics). To do so requires that
an investigator obtain sample data from each of the populations under study.
Conclusions can then be based on the computed values of various sample quan-
tities. For example, let m (a parameter) denote the average duration of anesthesia
for a short-acting anesthetic. A random sample of n ¼ 10 patients might be
chosen, and the duration for each one determined, resulting in obser ved durations
x
1
, x
2
,...,x
10
. The sample mean duration
x could then be used to draw a conclusion
about the value of m. Similarly, if s
2
is the variance of the duration distribution
(population variance, another parameter), the value of the sample variance s
2
can be
used to infer something about s
2
.
When discussing general concepts and methods of inference, i t is conve-
nient to have a generic symbol for the parameter of interest. We will use the Greek
letter y for this purpose. The objective of point estimation is to select a single
number, based on sample data, that represents a sensible value for y. Suppose,
forexample,thattheparameterofinterestism, the true average lifetime of
batteries of a certain type. A random sample of n ¼ 3 batteries might yield
observed lifetimes (hours) x
1
¼ 5.0, x
2
¼ 6.4, x
3
¼ 5.9. The computed value of
the sample mean lifetime i s
x ¼ 5:77, a nd it is reasonable to regard 5.77 as a very
plausi bl e value o f m, our “best guess” for the value of m based on the available
sample inform ation.
Suppose we want to estimate a parameter of a single population (e.g., m or s)
based on a random sample of size n. Recall from the previous chapter that before
data is available, the sample observations must be considered random variables
(rv’s) X
1
, X
2
,...,X
n
. It follows that any functi on of the X
i
’s—that is, any statistic—
such as the sample mean
X or sample standard deviation S is also a random variable.
The sam e is true if available data consists of more than one sample. For example,
we can represent duration of anesthesia of m patients on anesthetic A and n patients
on anesthetic B by X
1
,...,X
m
and Y
1
,...,Y
n
, respectively. The difference between
the two sample mean durations is
X Y, the natural statistic for making inferences
about m
1
– m
2
, the difference betwee n the population mean durations.
DEFINITION
A point estimate of a parameter y is a single number that can be regarded as a
sensible value for y. A point estimate is obtained by selecting a suitable
statistic and computing its value from the given sample data. The selected
statistic is called the point estim ator of y.
In the battery example just given, the estimator used to obtain the point
estimate of m was
X, and the point estimate of m was 5.77. If the three observed
lifetimes had instead been x
1
¼ 5.6, x
2
¼ 4.5, and x
3
¼ 6.1, use of the estimator X
would have resulted in the estimate
x ¼ð5:6 þ 4:5 þ 6:1Þ=3 ¼ 5:40. The symbol
^
y
(“theta hat”) is customarily used to denote both the estimator of y and the point
332 CHAPTER 7 Point Estimation