
a. Are the scores from the SI group a sample from
an existing population? If so, what is it? If not,
what is the relevant conceptual population?
b. What do you think is the advantage of randomly
dividing the students into the two groups rather
than letting each student choose which group to
join?
c. Why didn’t the investigators put all students in
the treatment group? [Note: The article “Supple-
mental Instruction: An Effective Component of
Student Affairs Programming” J. Coll. Stud.
Dev., 1997: 577–586 discusses the analysis of
data from several SI programs.]
6. The California State University (CSU) system con-
sists of 23 campuses, from San Diego State in the
south to Humboldt State near the Oregon border.
A CSU administrator wishes to make an inference
about the average distance between the hometowns
of students and their campuses. Describe and dis-
cuss several different sampling methods that might
be employed.
7. A certain city divides naturally into ten district
neighborhoods. A real estate appraiser would like
to develop an equation to predict appraised value
from characteristics such as age, size, number of
bathrooms, distance to the nearest school, and
so on. How might she select a sample of single-
family homes that could be used as a basis for this
analysis?
8. The amount of flow through a solenoid valve in an
automobile’s pollution-control system is an impor-
tant characteristic. An experiment was carried out
to study how flow rate depended on three factors:
armature length, spring load, and bobbin depth.
Two different levels (low and high) of each factor
were chosen, and a single observation on flow was
made for each combination of levels.
a. The resulting data set consisted of how many
observations?
b. Does this study involve sampling an existing
population or a conceptual population?
9. In a famous experiment carried out in 1882,
Michelson and Newcomb obtained 66 observations
on the time it took for light to travel between two
locations in Washington, D.C. A few of the mea-
surements (coded in a certain manner) were 31, 23,
32, 36, 22, 26, 27, and 31.
a. Why are these measurements not identical?
b. Does this study involve sampling an existing
population or a conceptual population?
1.2
Pictorial and Tabular Methods
in Descriptive Statistics
There are two general types of methods within descriptive statistics. In this section
we will discuss the first of these types—representing a data set using visual
techniques. In Sections 1.3 and 1.4, we will develop some numerical summary
measures for data sets. Many visual techniques may already be familiar to you:
frequency tables, tally sheets, histograms, pie charts, bar graphs, scatter diagrams,
and the like. Here we focus on a selected few of these techniques that are most
useful and relevant to probability and inferential statistics.
Notation
Some general notation will make it easier to apply our methods and formulas to
a wide variety of practical problems. The number of observations in a single
sample, that is, the sample size, will often be denoted by n, so that n ¼ 4 for
the sample of universities {Stanford, Iowa State, Wyoming, Rochester} and also
for the sample of pH measurements {6.3, 6.2, 5.9, 6.5}. If two samples are
simultaneously under consideration, either m and n or n
1
and n
2
can be used to
denote the numbers of observations . Thus if {3.75, 2.60, 3.20, 3.79} and {2.75,
1.20, 2.45} are grade point averages for students on a mathematics floor and the rest
of the dorm, respectively, then m ¼ 4 and n ¼ 3.
1.2 Pictorial and Tabular Methods in Descriptive Statistics 9