Identify DNA Germline Variants workflow

The Identify DNA Germline Variants workflow includes the minimum analysis steps necessary for variant detection when starting with a set of sequencing reads. This workflow also includes steps to generate reports and visualizations.

We recommend that samples with known variants are used to test and to optimize the workflow settings to fit your specific application. Suggestions for customizations of this workflow are provided at the end of this section.

The workflow requires reference data. Any available, relevant reference sequence can be chosen. Reference data for some organisms can be downloaded using the Reference Data Manager (References management).

Launching the workflow

The Identify DNA Germline Variants workflow is at:

        Workflows | Template Workflows | Basic Workflow Designs (Image basic_twf_folder_closed_16_n_p) | Identify DNA Germline Variants (Image germline_variant_calling_twf_16_n_p)

Key steps when launching the workflow include:

  1. Selecting the sequence lists containing the reads to analyze.
  2. Selecting relevant reference data (figure 14.106).
  3. Configuring trimming options.
  4. Optionally, restricting the InDels and Structural Variants tool to call variants only in target regions (figure 14.107).
  5. Optionally, restricting Fixed Ploidy Variant Detection to call variants only in target regions.
  6. Specifying thresholds for the variants to be reported (figure 14.108).
  7. Selecting a location to save outputs to.

Tools in the workflow and outputs generated

The tools and outputs provided by this workflow are:

Image dna_germline_wf_select_reference_data
Figure 14.106: In the "Specify reference data handling" wizard step, you can select a Reference Data Set with the relevant reference sequence, as shown here, or choose the "Use specified data elements" option and then choose a particular reference sequence in the following wizard step.

Image dna_germline_wf_select_target_region
Figure 14.107: If a target region track is specified in this wizard step, the InDels and Structural Variants tool will only call variants within those target regions.

Image dna_germline_wf_set_filtering_criteria
Figure 14.108: Minimum frequency and quality values can be configured to remove marginal variants from the results.

Customizing the Identify DNA Germline Variants workflow

Template workflows can be easily edited to add or remove analysis steps, change parameter settings, and so on. See Template workflows for information about how to open a template workflow for editing.

Some changes that may be of particular interest when working with the Identify DNA Germline Variants workflow are: