A multiple alignment is probably the most useful investigative tool you can
generate in bioinformatics. In many ways, multiple sequence alignments are
to bioinformatics what Swiss knives are to MacGyver; we’re hard-pressed to
think of a situation where they don’t help. If you aren’t sure whether a multi-
ple alignment can help your work — or if you don’t see exactly how you can
fit one of these into your experiments — we suggest that you start reading
the first section of this chapter, “Finding Out if a Multiple Sequence
Alignment Can Help You.”
In any case, remember that multiple alignments are useful for predicting
protein structures (see Chapter 11), central for predicting the function of
proteins, and indispensable for phylogenetic analysis (see Chapter 13). Of
course, the better your multiple sequence alignment, the better use you can
make of it: better structural models, better functional predictions, and better
phylogenetic trees.
Even so, building multiple sequence alignments is far from an exact science.
In fact, it’s more art than science, requiring that you use everything you know
in bioinformatics and in biology. This chapter is here to give you the secrets
of the trade so that you, too, will be able to create the multiple sequence
alignment that best suits your needs — and even find your own recipes.
If you already have an alignment and you only want to modify it or improve
its appearance for publication (highlighting, shading, and so on), go directly
to Chapter 10 — that’s the chapter entirely dedicated to such fine-tuning
techniques.
Finding Out if a Multiple Sequence
Alignment Can Help You
Figure 9-1 shows you what a multiple alignment actually looks like: As you
can see, it’s really only a matter of rewriting your sequences so similar fea-
tures end up in the same columns. That’s it, plain and simple.
This kind of multiple alignment is ideal if you want to study a sequence family
where all sequences share the
same common ancestor. Don’t worry if you
have only one member at this stage of the game; we show you later in this
chapter how to organize a family gathering!
266
Part III: Becoming a Pro in Sequence Analysis