each other’s closest relatives, with C as the outgroup, despite the fact that
B and C are actually the closest relatives in the true species phylogeny. The
gene tree—the phylogeny established from that particular gene—is not
wrong. It accurately reflects the history of that gene. The problem is that the
history of the particular gene may not reflect the history of the genome as a
whole, and thus the gene tree can lead us astray from determining the true
phylogenetic relationships of the species (the species tree).
The problem that these polymorphisms existing in the ancestral species
(and hence, called ancestral polymorphisms) can cause usually presents itself
when the distance between nodes and is relatively small.
6
When the
difference between the two nodes is large, there is enough time for one or
the other genetic variant to become fixed in the ancestor of species B and C,
and thus both B and C (the two closest related species) will have the same
variant. This time is usually on the order of several times the effective popu-
lation size in generations. If the effective population size is ,, then after
a , generations or so, the problem is mitigated; , generations in
an ape, however, is on the order of a million years. (If the effective popula-
tion sizes of humans and the nonhuman apes were greater, then this problem
would be worse because more variants would have been maintained during
the time between nodes and .)
So how do population geneticists cope with this pesky problem raised by
the assortment of ancestral polymorphisms? The usual approach is to collect
more data—that is, more DNA sequences from as many genes as possible—
and then determine whether a consensus emerges among the genes. With the
increasing speed and decreasing cost of sequencing, this brute force method
has become progressively easier. In well-studied groups such as the great
apes, such power became available by the early and middle s. As the
technology progresses, sequencing multiple genes will become par for the
course even in groups of species from organisms that are not as well studied.
Returning to the human-chimp-gorilla story, the preponderance of genes
examined does support the close relationship between humans and chimps.
During the middle s, Maryellen Ruvolo examined the data from a
number of genes.
7
She first lumped genes that were very closely genetically
linked into a single data set, as linked genes may not be independent because
they tend to share the same evolutionary history and hence genealogy. As an
extreme example, all of the mitochondrial genes count as one independent
gene because the lack of recombination ties their fates together. She also
threw out those genes that failed to conclusively support any phylogenetic
hypothesis. After this culling of the data, Ruvolo was left with independent
genes that each provided conclusive phylogenies. Eleven of these genes support
a tree with humans and chimpanzees as the closest relatives, two support a tree
that places gorillas and chimps together, and just one gene supports a tree that
has humans and gorillas as closest relatives. How strong is the support for
humans and chimpanzees being closest relatives? Very strong; if humans
and chimpanzees were not each other’s closest relatives, the likelihood of
DARWINIAN DETECTIVES