
62 Chapter 2
How to find the median in three steps:
1. Line your numbers up in order, from smallest to largest.
2. If you have an odd number of values, the median is the one
in the middle. If you have
n
numbers, the middle number is
at position (
n
+ 1) / 2.
3. If you have an even number of values, get the median by
adding the two middle ones together and dividing by 2.
You can find the midpoint by calculating (
n
+ 1) / 2. The
two middle numbers are on either side of this point.
Q:
Is it still OK to use the mean with skewed data if I really
want to?
A: You can, and people often do. However, in this situation the
mean won’t give you the best representation of what a typical value
is. You need the median.
Q:
You say that, but surely the whole point of the mean is
that it gives a typical value. It’s the average.
A: The big danger is that the mean will give a value that doesn’t
exist in the data set. Take the Kung Fu class as an example. If you
were to go into the class and pick a person at random, the chances
are that person would be around 20 years old because most people
in the class are that sort of age. Just going with the mean doesn’t
give you that impression. Finding the median can give you a more
accurate perspective on the data.
But sometimes even the median will give a value that’s not in the
data set, like our example on the previous page. That’s precisely why
there’s more than one sort of average; sometimes you need to use
different methods in order to accurately say what a typical value is.
Q:
So is the median better than the mean?
A: Sometimes the median is more appropriate than the mean,
but that doesn’t make it better. Most of the time you’ll need to use
the mean because it usually offers significant advantages over the
median. The mean is more stable when you are sampling data. We’ll
come back to this later in the book.
Q:
How do I use the mean or median with categorical data?
What about examples like the data on page 9 of Chapter 1?
A: You can only find the mean and median of numerical data.
Don’t worry, though, there’s another sort of average that deals with
just this problem that we’ll explore later on.
Q:
I always get right- and left-skewed data mixed up. How do
I remember which is which?
A: Skewed data has a “tail” of outliers. To see which direction
the data is skewed in, find the direction the tail is pointing in. For
example, right-skewed data has a tail that points to the right.
calculating the median: step-by-step