
|
Note...
* In
order to relate probability distribution Q to any other
sequence
alignment (e.g. during running of the Pintail
algorithm) it is necessary to include a reference sequence to serve
as a suitable position guide. Q probabilities are
calculated with
reference to this sequence, hence the size of Q corresponds to
the
length of the reference. The implementation of the Pintail
algorithm used by the Pintail and Mallard programs uses a Q
distribution generated with Escherichia
coli K12 U00096 as reference.
§ At any particular
position i,
the
values for fi will range from 0.25 to 1, inclusive; fi
can never be lower than 0.25 since the lowest possible frequency of the
most common base occurs when all four bases (A, C, G, and T/U) occur in
equal numbers (that is, are equally common). Consequently, to
convert frequencies to probabilities (which should range from 0 to 1,
inclusive),
each
frequency is converted with the equation pi = (fi
- 0.25)/0.75. |
|