Annotate CDS with Best DIAMOND Hit

The Annotate CDS with Best DIAMOND Hit tool will allow you to annotate a set of contigs containing CDS annotations with their best DIAMOND hit. This tool is particularly useful for large data sets, as an alternative to Annotate CDS with Best BLAST Hit.

DIAMOND is a sequence aligner for protein and translated DNA searches, designed for high performance analysis of big sequence data. The key features are:

The version of the DIAMOND binaries bundled with the tool is v0.9.26.127. For questions or comments about DIAMOND, see https://github.com/bbuchfink/diamond.

To start the tool, go to:

        Toolbox | Microbial Genomics Module (Image mgm_folder_closed_flat_16_h_p) | Functional Analysis (Image functional_analysis_folder_closed_16_n_p) | Annotate CDS with Best DIAMOND Hit (Image annotate_with_best_diamond_16_n_p)

Several parameters are available (figure 13.9):

Image diamondparameters
Figure 13.9: BLAST Best Hit annotations added to gene cds4 of h. pylori.

The tool will output a copy of the input file with the DIAMOND Hit annotations. The tool can also output an annotation table summarizing information about the annotations added to the sequence list. Finally it is possible to generate a report containing information about the input file, the DIAMOND database and the amount of CDS annotated with a DIAMOND hit.

If a DIAMOND index was created from a protein sequence list containing metadata (such as GO terms or taxonomy information), the original metadata will be transferred to the annotations created by this tool.