Create Consensus Sequences from Variants

Using the Create Consensus Sequences from Variants tool, consensus sequences can be created from a Variant track (Image variant_track_16_n_p) and the matching reference genome (Image sequence_dna).

It is possible to mask out low coverage regions (Image annotation_track_16_n_p) with N's when a coverage track is provided. To create a low coverage region track use the Create Mapping Graph (Image graph_track_16_n_p) tool, see Create Mapping Graph, to identify coverage in your sample followed by filtering on specified frequency using the Identify Graph Threshold Areas (Image resequencing) tool, see Identify Graph Threshold Areas.

A number of variant filtering options defining criteria for inclusion in the consensus sequence are available:

For variants remaining after filtering, the following rules apply for sites where variants overlap:

Running the Create Consensus Sequences from Variants tool

To run the Create Consensus Sequences from Variants tool, go to:

        Tools | Resequencing Analysis (Image resequencing) | Create Consensus Sequences from Variants (Image consensus_from_variants_16_h_p)

In the first step, select the variant track that contains the variants from which the consensus should be created.

The next step provides options for how to construct the consensus sequences (see figure 32.35).

Image consensus_wizard
Figure 32.35: Options available to configure when running the Create Consensus Sequences from Variants tool.

In the Tracks parameters, specify the reference genome and optionally provide an annotation track containing low coverage regions.

Variant handling parameters include:

In the final step, specify the output type and a save data location.

The tool offers two types of consensus sequence format: