Our first step is to construct a database for Primrose to use.
Our database should contain representatives of the target taxon we wish
to design a probe for, as well as members of bacterial taxa we want to
exclude from the probe's range. Ideally our database should be as
comprehensive as possible without being so big as to impact adversely on
the program's performance. In this example, for illustrative
purposes, we use sequence data from a range of sources.
We click 'Construct
Database' button and the following dialog box appears:
We next select 'Load Sequence File(s)' and use the resulting Open File
Dialog box to load the sequence files which will make up our final
database. Our target taxon is represented by a Fasta formatted
collection of 16S rRNA partial sequences, and we load this first:
From a previous analysis we already know our taxon appears to be a
novel group within the Bacteroidetes
phylum, so next we visit the Ribosomal Database Project and download the
sequences for all Bacteroidetes isolates greater than or equal to 1200
nucleotides in length from their preview site (http://rdp.cme.msu.edu/html/analyses_preview.html).
We select to download the data in GenBank format with common gaps
removed (to retain their alignment but exclude superfluous gap
characters). The resulting 1304 sequence file is incorporated into
our database:
Finally we decide to include small-subunit ribosomal RNA prokaryote
data from the Ribosomal Database project Release 8.1 database (included
with Primrose within the 'databases'
directory). This database is useful here because (i) it
includes a phylogenetic tree which can be used by Primrose, and (ii) it
covers a wide range of prokaryote taxa including Archaeal groups.
The prokaryotes data is loaded by selecting the SSU_Prok.phylo file
within the 'databases' directory:
The final database appears as follows:
Our database complete, we now press the 'Accept' button and return to
the main window.