Identify QIAseq DNA Ultra Somatic Variants

The Identify QIAseq DNA Ultra Somatic Variants template workflow supports analysis of Illumina QIAseq Ultra panel data. The ultra panels are designed to provide high coverage in targeted regions to allow identification of low frequency variants in cfDNA. As the read structure is different from standard QIAseq panels, the Identify QIAseq DNA Ultra Somatic Variants template workflow should only be used to process data from from QIAseq Ultra panels.

The Identify QIAseq DNA Ultra Somatic Variants workflow is set up to detect very low frequency variants. Please note that to call low frequency variants, coverage must be high. In low coverage samples or regions, very low frequency variants are unlikely to be represented in the reads.

The primers and target regions for the Ultra panels are available in the reference data set QIAseq DNA Ultra Panels hg38.

To run the workflow go to:

        Template Workflows | Biomedical Workflows (Image biomedical_twf_folder_open_16_n_p) | QIAseq Panel Analysis (Image qiaseqrna_folder_closed_16_n_p) | QIAseq DNA Workflows (Image qiaseq_workflows_folder_closed_16_n_p) | Identify QIAseq DNA Ultra Somatic Variants (Illumina) (Image qiaseqv3_molecolors1)

You can then select the reads to analyze (figure 13.42).

Image cfdna1
Figure 13.42: Choose the reads sequenced with an Ultra panel.

In the next dialog, check "Select a reference data set to use" and choose QIAseq DNA Ultra Panels hg38 (figure 13.43).

Image ultra2
Figure 13.43: Choose the appropriate Reference Data Set.

In the Target regions dialog, select the panel specific target regions track (figure 13.44). Variants will only be detected within target regions.

Image ultra3
Figure 13.44: Choose the relevant Target regions track.

Repeat the selection of the appropriate track for Target primers in the subsequent dialog.

In the Map Reads to Reference dialog, it is possible to configure masking. A custom masking track can be used, but by default, the masking track is set to GenomeReferenceConsortium_masking_hg38_no_alt_analysis_set, containing the regions defined by the Genome Reference Consortium, which serve primarily to remove false duplications, including one affecting the gene U2AF1. Changing the masking mode from "No masking" to "Exclude annotated" excludes these regions.

In the Create UMI Reads from Grouped Reads dialog, it is possible to specify settings for UMI grouping (figure 13.45).

Image ultra4
Figure 13.45: Configuring the settings for creation of UMI reads.

The QIAseq DNA Ultra data is expected to contain very large UMI groups and more PCR or sequencing errors may consequently be present in the UMIs compared to other sequencing protocols. Therefore, settings for grouping reads into UMI groups should be more relaxed than settings for standard panels. This is reflected in the default settings for Create UMI Reads from Grouped Reads in this workflow. See Create UMI Reads from Grouped Reads for details about UMI grouping using the tool Create UMI Reads from Grouped Reads.

In the QC for Target Sequencing dialog, specify the minimum coverage for QC (figure 13.46). Using default settings, samples where 90 percent of target region positions do not meet this threshold will be flagged in the Combined report generated by the workflow.

Image ultra5
Figure 13.46: Configuring the QC for Target Sequencing tool.

The Copy Number Variant Detection (CNVs) dialog allows you to specify a control mapping against which the coverage pattern in your sample will be compared in order to call CNVs (figure 13.47). If you do not specify a control mapping the Copy Number Variation analysis will not be carried out.

Image ultra6
Figure 13.47: Select control mappings for the CNV detection tool.

Please note that if you want the copy number variation analysis to be done, it is important that the control mapping supplied is a meaningful control for the sample being analyzed. Mapping of control samples for the CNV analysis can be done using the workflow described in Create QIAseq DNA Ultra CNV Control Mapping.

A meaningful control must satisfy two conditions: (1) It must have a copy number status that is meaningful to compare against. For panels with targets on the X and Y chromosomes, the control and sample should be matched for gender. (2) The control read mapping must result from the same type of processing that will be applied to the sample.

For interpretation of fold changes in low purity samples see http://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=How_interpret_fold_changes_when_sample_purity_is_not_100.html.

Next, set the variant quality filtering thresholds QUAL and Average quality (figure 13.48). Variants with quality below the specified thresholds will not be reported.

Image ultra7
Figure 13.48: Specify quality threshold for filtering variants.

In the following wizard steps, it is possible to specify additional variant filtering criteria. When each of these have been set, choose where to save the results and press Finish to start the analysis.

Launching using the QIAseq Panel Analysis Assistant

The workflow is also available in the QIAseq Panel Analysis Assistant under Targeted DNA Ultra.



Subsections