Shuffle sequence
In some cases, it is beneficial to shuffle a sequence. This is an option in the Toolbox menu under General Sequence Analyses. It is normally used for statistical analyses, e.g. when comparing an alignment score with the distribution of scores of shuffled sequences.
Shuffling a sequence removes all annotations that relate to the residues.
select sequence | Toolbox in the Menu Bar | Classical Sequence Analysis () | General Sequence Analysis ()| Shuffle Sequence ()
or right-click a sequence | Toolbox | Classical Sequence Analysis () | General Sequence Analysis ()| Shuffle Sequence ()
This opens the dialog displayed in figure 14.3:
Figure 14.3: Choosing sequence for shuffling.
If a sequence was selected before choosing the Toolbox action, this sequence is now listed in the Selected Elements window of the dialog. Use the arrows to add or remove sequences or sequence lists from the selected elements.
Click Next to determine how the shuffling should be performed.
In this step, shown in figure 14.4:
Figure 14.4: Parameters for shuffling.
For nucleotides, the following parameters can be set:
- Mononucleotide shuffling. Shuffle method generating a sequence of the exact same mononucleotide frequency
- Dinucleotide shuffling. Shuffle method generating a sequence of the exact same dinucleotide frequency
- Mononucleotide sampling from zero order Markov chain. Resampling method generating a sequence of the same expected mononucleotide frequency.
- Dinucleotide sampling from first order Markov chain. Resampling method generating a sequence of the same expected dinucleotide frequency.
- Single amino acid shuffling. Shuffle method generating a sequence of the exact same amino acid frequency.
- Single amino acid sampling from zero order Markov chain. Resampling method generating a sequence of the same expected single amino acid frequency.
- Dipeptide shuffling. Shuffle method generating a sequence of the exact same dipeptide frequency.
- Dipeptide sampling from first order Markov chain. Resampling method generating a sequence of the same expected dipeptide frequency.
Click Next if you wish to adjust how to handle the results. If not, click Finish.
This will open a new view in the View Area displaying the shuffled sequence. The new sequence is not saved automatically. To save the sequence, drag it into the Navigation Area or press ctrl + S ( + S on Mac) to activate a save dialog.