Computing evolutionary history using the sequences reversed
and
reversed is computationally the same problem as for
and
as given.
Let be the probability for the sequences
and
to align by
some evolutionary history. Let
be
normalized
by the probability to observe the given sequences.
The base cases are
The product
is the
probability
and
arose from some evolutionary history
which restricts to an evolutionary history for
and
, and
for
and
Let
Every entry in is comparable to every other entry. The
contours near the maximum value of
bound the region containing
the paths of the most likely alignments.
We propose another pair of arrays and
which
weight recent history more than distant history. We adapt the
standard technique of approximating a sliding window by multiplying
an accumulator by a factor between zero and one, and adding the new
data value. An alternative interpretation is that we view the
developing evolutionary history through a fog which makes distant
states uncertain. The thickness of the historical fog is a parameter
The generic computation for
is
Another model with the same mathematical description is
that with probability
the subsequences
and
are observed
and discarded from consideration in the evolutionary history.
The Smith-Waterman algorithm is a Viterbi path application
of the similar idea that the alignment process might have
optimal score applied to subsequences of
and
.
We use the parameter
When , and the other parameters are equal,
We implement the computations
We also compute the equilibriuim value for
noise given parameters
and distribution
.
This is the limit for large
of
for
long random sequences constructed from distribution
Using , we can approximate
in any shape
region by clamping the boundary values of
and
equal to
when they are not otherwise
actually calculated. This makes sense, for example, when
we want to avoid computing on an area where we believe there
is no alignment and want results for an adjacent area.
We can also mask out a high scoring evolutionary history
by forcing
on its path. This allows a second
place ridge to be seen as maximum.