Identify QIAseq Exome Causal Inherited Variants in Trio

The Identify QIAseq Exome Causal Inherited Variants in Trio template workflow is designed to support the analysis of data generated with QIAseq Human Exome kits. The workflow identifies putative disease causing, inherited variants in a family of three, where there is an affected parent, unaffected parent and a proband.

The first steps of the workflow involve trimming off any remaining PCR adapters. This is followed by mapping the trimmed reads to the human reference sequence. The Structural Variant Caller then generates a guidance track that is used in the Local Realignment tool to improve the mapping. The improved mapping is then input to the Fixed Ploidy Variant Detection tool. The resulting variants are filtered to remove those located outside defined target regions. Remaining variants are then annotated with information such as the relation to repeat/homopolymer regions or gene elements. Finally, a series of filtering steps removes variants likely to be artifacts.

The putative disease causing variants are identified by creating a list of variants present in both affected individuals and subtracting all variants in the unaffected individual. Additional checks are then carried out, allowing variants to be classified, for example as de novo variants, recessive variants, etc. The variants are output, along with reports and other associated results.

The Identify QIAseq Exome Causal Inherited Variants in Trio template workflow can be found at:

        Template Workflows | Biomedical Workflows (Image biomedical_twf_folder_open_16_n_p) | QIAseq Sample Analysis (Image qiaseqrna_folder_closed_16_n_p) | QIAseq DNA workflows (Image qiaseq_workflows_folder_closed_16_n_p) | Identify QIAseq Exome Causal Inherited Variants in Trio (Image qiaseq_causal_var9)

If you are connected to a CLC Server via the CLC Workbench, you will be asked where you would like to run the analysis. We recommend that you run the analysis on a CLC Server when possible.

Separate dialog steps are presented for providing the sample data for the proband, the affected parent and the unaffected parent. The names of the steps in the left hand side of the wizard indicate the data that should be entered in that step. For example, sequencing reads for the proband would be selected in the step shown in figure 13.50.

Image exome_causal_select_reads
Figure 13.50: The sequence reads for the proband are specified at the "Select reads from proband" wizard step.

The following dialog helps you set up the relevant Reference Data Set. If you have not downloaded the Reference Data Set yet, the dialog will suggest the relevant data set and offer the opportunity to download it using the Download to Workbench button. (See figure 13.51).

Image select_reference_exome_causaltrio
Figure 13.51: The relevant Reference Data Set is highlighted. In the text to the right, the types of reference needed by the workflow are listed.

Note that if you wish to Cancel or Resume the Download, you can close the template workflow and open the Reference Data Manager where the Cancel, Pause and Resume buttons are available.

If the Reference Data Set was previously downloaded, the option "Use the default reference data" is available and will ensure the relevant data set is used. You can always check the "Select a reference set to use" option to be able to specify another Reference Data Set than the one suggested.

Both Map Reads to Reference and Fixed Ploidy Variant Detection are configured in separate dialog steps for the proband, the affected parent and the unaffected parent samples. The names of the steps in the left hand side of the wizard, and near the top of each dialog, indicate which sample the parameters apply to. For example, step 9 of the wizard shown in figure 13.51 includes the word "proband". This wizard step, displayed in figure 13.52, has the word "proband" in the title near the top of the dialog.

Image exome_causal_fixedploidy
Figure 13.52: Configuration of Fixed Ploidy Variant Detection tool for the proband sample analysis. This tool is configured separately for the anlaysis of each family member.

In the Map Reads to Reference dialog, it is possible to configure masking. A custom masking track can be used, but by default, the masking track is set to GenomeReferenceConsortium_masking_hg38_no_alt_analysis_set, containing the regions defined by the Genome Reference Consortium, which serve primarily to remove false duplications, including one affecting the gene U2AF1. Changing the masking mode from "No masking" to "Exclude annotated" excludes these regions.

The Fixed Ploidy Variant Detection settings:

In the next two wizard steps, individual filtering settings can be specified for SNVs and Indels for the proband.

In the final wizard step, choose to Save the results of the workflow and specify a location in the Navigation Area before clicking Finish.

Launching using the QIAseq Panel Analysis Assistant

The workflow is also available in the QIAseq Panel Analysis Assistant under Exome.



Subsections