Bioinformatics toolkit
www.cardiff.ac.uk/biosi/research/biosoft/

Mallard: Input File


When constructing an input file for the Mallard program ensure that:

Note...

When constructing an input file, consideration should also be given to the number of sequences and the phylogenetic diversity of the bacteria they represent.  Bear in mind that anomalies are detected by the program through a series of comparative analyses amongst the sequences included within the file.  So, try to avoid sequences that are too phylogenetically dissimilar, and try to include as many reliable sequences as possible, but no more than 1,000 as this will slow the program too much (program duration increases exponentially with sequence number).

As a rough guide, a good input file will be one that contains a reasonable number of sequences (say between 50-500), varying in overall phylogenetic difference by around 20% or less.
An example input file can be found here.


Index | Toolkit website

Dr K.E. Ashelford. © 2006, Cardiff University