Annotate CDS with Best DIAMOND Hit
The Annotate CDS with Best DIAMOND Hit tool will allow you to annotate a set of contigs containing CDS annotations with their best DIAMOND hit. This tool is particularly useful for large data sets, as an alternative to Annotate CDS with Best BLAST Hit.
DIAMOND is a sequence aligner for protein and translated DNA searches, designed for high performance analysis of big sequence data. The key features are:
- Pairwise alignment of proteins and translated DNA at 500x-20,000x speed of BLAST.
- Frameshift alignments for long read analysis.
- Low resource requirements and suitable for running on standard desktops or laptops.
The version of the DIAMOND binaries bundled with the tool is v0.9.21. For questions or comments about DIAMOND, see https://github.com/bbuchfink/diamond.
To start the tool, go to:
Metagenomics () | Functional Analysis (
) | Annotate CDS with Best DIAMOND Hit (
)
Several parameters are available:
- Genetic code parameters. The genetic code used for translating CDS to proteins.
- DIAMOND parameters.
- Expectation value. The minimum expectation value (E) threshold to use.
- Protein Sequence Database. Popular public protein sequence databases can be downloaded using the Download Protein Database tool (see section Download Protein Database).
The tool will output a copy of the input file with the DIAMOND Hit annotations. The tool can also output an annotation table summarizing information about the annotations added to the sequence list. Finally it is possible to generate a report containing information about the input file, the DIAMOND database and the amount of CDS annotated with a DIAMOND hit.