We consider further implications of the reversibility
of the ground process. The parameters and
are
a priori two degrees of freedom in the ground process.
If the lengths of
and
are informative, setting
and
to maximize the
probability to observe sequences of the given lengths
gives a relationship between
and
. We choose
and
derive
.
If the sequences
and
are subsequences of very very long genomes, the
lengths of
and
may be artifacts of truncation.
We express this in the ground process by
, making
insertions and deletions equally likely. In this case,
extension of
is a zero information event,
expressed by
. Other transition probabilities are computed
in terms of
Gaps at the ends of alignments could be artifacts
of truncation. We model this by replacing
by
at the edges of the
array.
This models extending the sequences in each direction
infinitely with bases selected from distribution
We normalize by dividing by the probability to observe
sequences and
separately given
and
and omitting the factor for the inital state
. We report separately the log likelihood to observe
the given sequences,
Assembling the above insights, we compute
The array provides the same kind of information as the
.
The essential difference is that the gap cost for leading and
trailing gaps is canceled. It is possible to treat gaps
differently along each edge of the array.