Working with annotations
Annotations provide information about specific regions of a sequence.
A typical example is the annotation of a gene on a genomic DNA sequence.
Annotations derive from different sources:
- Sequences downloaded from databases like GenBank are annotated.
- In some of the data formats that can be imported into CLC Genomics Workbench, sequences can have annotations (GenBank, EMBL and Swiss-Prot format).
- The result of a number of analyses in CLC Genomics Workbench are annotations on the sequence (e.g. finding open reading frames and restriction map analysis).
- A protein structure can be linked with a sequence (Link sequence or sequence alignment to structure), and atom groups defined on the structure transferred to sequence annotations or vica versa (Transfer annotations between sequence and structure).
- You can manually add annotations to a sequence (described in the Adding annotations).
If you would like to extract parts of a sequence (or several sequences) based on its annotations, you can find a description of how to do this in Extract Annotations.
Note! Annotations are included if you export the sequence in GenBank, Swiss-Prot, EMBL or CLC format. When exporting in other formats, annotations are not preserved in the exported file.
Subsections
