Annotate CDS with Best DIAMOND Hit

Annotate CDS with Best DIAMOND Hit allows you to annotate a set of contigs containing CDS annotations with their best DIAMOND hit. This tool is particularly useful for large data sets, as an alternative to Annotate CDS with Best BLAST Hit.

DIAMOND is a sequence aligner for protein and translated DNA searches, designed for high performance analysis of big sequence data, see https://github.com/bbuchfink/diamond. The key features are:

To start the analysis, go to:

        Toolbox | Microbial Genomics Module (Image mgm_folder_closed_flat_16_h_p) | Functional Analysis (Image functional_analysis_folder_closed_16_n_p) | Annotate CDS with Best DIAMOND Hit (Image annotate_with_best_diamond_16_n_p)

Select the CDS-annotated contigs to be annotated with DIAMOND hits.

Image diamondparameters
Figure 11.9: Annotate CDS with Best DIAMOND Hit parameters.

In the Parameters dialog page (figure 11.9), set the following

The tool will output a copy of the input file with the DIAMOND Hit annotations. The tool can also output an annotation table summarizing information about the annotations added to the sequence list. Finally it is possible to generate a report containing information about the input file, the DIAMOND database and the amount of CDS annotated with a DIAMOND hit.

If a DIAMOND index was created from a protein sequence list containing metadata (such as GO terms or taxonomy information), the original metadata will be transferred to the annotations created by this tool.