Extracting a subset of NCBI's bacterial genomes database
The NCBI collection of bacterial genomes generally includes multiple representatives of each genus. To extract a genus specific subset of sequences to a new list:
- Open the downloaded bacterial genomes database.
- Switch to tabular element mode (
).
- Filter towards the desired genus (e.g. Salmonella sequences as in figure 10.5).
- Select all rows.
- Click the Create New Sequence List button.
- Save the subset reference list.
Figure 10.5: The downloaded NCBI bacterial genomes database was filtered for Salmonella data. A subset of 44 out of 2,253 sequences matched this search criterion.