Working with annotations
Annotations provide information about specific regions of a sequence. A typical example is the annotation of a gene on a genomic DNA sequence.
Annotations derive from different sources:
- Sequences downloaded from databases like GenBank are annotated.
- In some of the data formats that can be imported into CLC Genomics Workbench, sequences can have annotations (GenBank, EMBL and Swiss-Prot format).
- The result of a number of analyses in CLC Genomics Workbench are annotations on the sequence (e.g. finding open reading frames and restriction map analysis). A protein structure can be linked with a sequence (Link sequence or sequence alignment to structure), and atom groups defined on the structure transferred to sequence annotations or vica versa (Transfer annotations between sequence and structure). You can manually add annotations to a sequence (described in the Adding annotations).
If you would like to extract parts of a sequence (or several sequences) based on its annotations, you can find a description of how to do this in Extract Annotations.
Note! Annotations are included if you export the sequence in GenBank, Swiss-Prot, EMBL or CLC format. When exporting in other formats, annotations are not preserved in the exported file.
Subsections