Using Mfold
Mfold is an implementation of the Zuker algorithm that makes it possible to
predict the energetically optimal secondary structure of an RNA molecule. It
uses a sophisticated model that can take into account many realistic physical
parameters that affect the RNA folding — such as pH, temperature, and the
local composition bias of your RNA.
355
Chapter 12: Working with RNA
RNA secondary structures
A single-stranded RNA molecule works exactly
like sticky tape. It will not be stable unless some
of its exposed bases are protected from water.
A good way to protect an RNA base from the
solvent is to pair it with another RNA base. Of
course, all possible base-pair combinations are
not equally good. Pairing a guanine with a cyto-
sine (for example) is more stabilizing than pair-
ing an adenine with a uracile.
The pairing of these bases forms the
RNA sec-
ondary structure.
When a molecule contains
two long stretches that are complementary,
they yield a nice stable stem. (Refer to Figure
12-1.) The unpaired bases between the stem
strands make up a loop. Stems don’t have to
be perfect; they can also contain unpaired
residues (which RNA gurus name
bulges
)
.
We assume that the natural tendency of the
RNA molecule is to reach its most stable con-
formation by assembling a nice collection of
pairwise interactions, giving the molecule the
highest stability it can have. This concept is
what we call the
lowest-energy model.
Such a
stable RNA structure always has a negative
energetic value (such as –70 Kcal/mol); if you
want to unfold it, you need to provide some
energy (heat).
As is true for proteins, we don’t know exactly
how the RNA molecule finds this lowest energy
form — but we know this happens very rapidly
in the cell. The stability of an RNA fold doesn’t
completely depend on the number of GC pairs it
contains. Many parameters — such as the
stacking of base pairs (making some stems
much more stable than others) and loop sizes —
influence the fold. Sophisticated algorithms (like
that found in the mfold program) can take these
subtle effects into account when computing an
optimal fold.
Tertiary interactions also play a role in overall
stability. These include
pseudo-knots
(again,
refer to Figure 12-1) that are usually long-range
interactions between a loop and another
portion of the RNA molecule. The interaction
between the RNA molecule and other chemical
elements — such as ions, proteins, or other
RNAs — also plays an important role in its
stabilization.
Unfortunately, we have difficulties predicting
tertiary interactions or the effect of the proteins
on the folding of an RNA molecule. So, when
you predict the secondary structure of your
RNA, this prediction usually depends on the
assumption that this RNA folds on its own in the
cell. Because this is almost never true, there is
always a chance for your prediction to be partly
or totally incorrect. The general rule is that the
most energetically stable features tend to be
reasonably close to the truth.