Create Motif List from Sequences

Create Motif List from Sequences takes as input sequences (Image sequence_dna) (Image sequence_rna) (Image sequence_protein2), sequence lists (Image seq_list_nucleotide) (Image seq_list_protein), or alignments (Image alignment), and outputs a motif list (Image motiflist_16_n_p) containing simple motifs created from the provided sequences or their annotations.

To run the tool, go to:

        Tools | General Sequence Analysis (Image generalsequenceanalyses)| Motif (Image motif_folder_closed_16_n_p) | Create Motif List from Sequences (Image new_motiflist_16_n_p)

The following options can be configured (figure 18.24):

Image createmotifsfromsequences
Figure 18.24: Creating a new motif list from sequences.

A simple motif is created for each region defined by the selected options, using the sequence residues within that region (figure 18.25). Each created motif contains the following information:

Image createmotifsfromsequences_output
Figure 18.25: Motif list created from a sequence containing multiple types of annotations. Top: Sequence view. Middle: Sequence annotation table view. Bottom: Motif list view.

Duplicate motif collapsing

If several motifs have the same annotation type and motif sequence, they are collapsed into a single motif (figure 18.25). For the combined motif, the "Description" and each type of annotation note, including "From sequence" and "From annotation", retain up to five unique notes from the original duplicated motifs. The "Name" is set to that of the first duplicated motif.

Special handling of complex annotations

Annotations with a region that is not simple are processed as follows: