
RNA Secondary Structure Boltzmann Distribution R 779
model or that the real structure indeed does fluctuate be-
tween several low energy alternatives. Moments of Boltz-
mann distributions are used in identifying how biological
RNA molecules deviates from random RNA sequences.
The data structures computed in Theorem 1 can also
be used to efficiently sample secondary structures from the
Boltzmann distribution. This has been used for probabilis-
tic methods for secondary structure prediction, where the
centroid of the most likely cluster of sampled structures is
returned rather than the most likely, i. e. minimum free en-
ergy, structure [3]. This approach better accounts for the
entropic effects of large neighborhoods of structurally and
energetically very similar structures. As a simple illustra-
tion of this effect, consider twice flipping a coin with prob-
ability p > 0:5 for heads. The probability p
2
of heads in
both flips is larger than the probability p(1 p)ofheads
followed by tails or tails followed by heads (which again is
larger than the probability (1 p)
2
of tails in both flips).
However, if the order of the flips is ignored the probabil-
ity of one heads and one tails is 2p(1 p). The probability
of two heads remains p
2
which is smaller than 2p(1 p)
when p <
2
3
. Similarly a large set of structures with fairly
low free energy may be more likely, when viewed as a set,
than a small set of structures with very low free energy.
Open Problems
As for RNA Secondary Structure Prediction by Minimum
Free Energy, improvements in time and space complexity
are always relevant. This may be more difficult for comput-
ing distributions, as the more efficient dynamic program-
ming techniques of [9]cannotbeapplied.Inthecontext
of genome scans, the fact that the start and end positions
of encoded RNA molecule is unknown has recently been
considered [1].
Also the problem of including structures with pseu-
doknots, i. e. structures violating the last requirement in
Def. 1, in the configuration space is an active area of re-
search. It can be expected that all the methods of Theo-
rems3through6intheentryonRNASecondaryStruc-
ture Prediction Including Pseudoknots can be modified to
computation of distributions without affecting complexi-
ties. This may require some further bookkeeping to ensure
non-redundancy of recursions, and only in [4] has this ac-
tively been considered.
Though the moments of functions that are defined as
sums over independent loop contributions can be com-
puted efficiently, it is unknown whether the same holds for
functions with more complex definitions. One such func-
tion that has traditionally been used for statistics on RNA
secondary structure [12]istheorder of a secondary struc-
ture which refers to the nesting depth of multibranched
loops.
URL to Code
Software for partition function computation and a range
of related problems is available from www.bioinfo.rpi.edu/
applications/hybrid/download.php and www.tbi.univie.
ac.at/~ivo/RNA/. Software including a restricted class of
structures with pseudoknots [4] is available at www.
nupack.org.
Cross References
RNA Secondary Structure Prediction Including
Pseudoknots
RNA Secondary Structure Prediction by Minimum
Free Energy
Recommended Reading
1. Bernhart, S., Hofacker, I.L., Stadler, P.: Local RNA base pairing
probabilities in large sequences. Bioinformatics 22, 614–615
(2006)
2. Bernhart,S.H.,Tafer,H.,Mückstein,U.,Flamm,C.,Stadler,P.F.,
Hofacker, I.L.: Partition function and base pairing probabilities
of RNA heterodimers. Algorithms Mol. Biol. 1, 3 (2006)
3. Ding, Y., Chan, C.Y., Lawrence, C.E.: RNA secondary structure
prediction by centroids in a Boltzmann weighted ensemble.
RNA 11, 1157–1166 (2005)
4. Dirks, R.M., Pierce, N.A.: A partition function algorithm for nu-
cleic acid secondary structure including pseudoknots. J. Com-
put. Chem. 24, 1664–1677 (2003)
5. Hofacker, I.L., Stadler, P.F.: Memory efficient folding algorithms
for circular RNA secondary structures. Bioinformatics 22,1172–
1176 (2006)
6. Lyngsø, R.B., Zuker, M., Pedersen, C.N.S.: Fast evaluation of in-
ternal loops in RNA secondary structure prediction. Bioinfor-
matics 15, 440–445 (1999)
7. McCaskill, J.S.: The equilibrium partition function and base pair
binding probabilities for RNA secondary structure. Biopoly-
mers 29, 1105–1119 (1990)
8. Miklós, I., Meyer, I.M., Nagy, B.: Moments of the Boltzmann dis-
tribution for RNA secondary structures. Bull. Math. Biol. 67,
1031–1047 (2005)
9. Ogurtsov, A.Y., Shabalina, S.A., Kondrashov, A.S., Roytberg,
M.A.: Analysis of internal loops within the RNA secondary struc-
ture in almost quadratic time. Bioinformatics 22, 1317–1324
(2006)
10. Tinoco, I., Borer, P.N., Dengler, B., Levine, M.D., Uhlenbeck, O.C.,
Crothers, D.M., Gralla, J.: Improved estimation of secondary
structure in ribonucleic acids. Nature New Biol. 246, 40–41
(1973)
11. Tinoco, I., Uhlenbeck, O.C., Levine, M.D.: Estimation of sec-
ondary structure in ribonucleic acids. Nature 230, 362–367
(1971)
12. Waterman, M.S.: Secondary structure of single-stranded nu-
cleic acids. Adv. Math. Suppl. Stud. 1, 167–212 (1978)