The Identify QIAseq Exome Causal Inherited Variants in Trio template workflow identifies putative disease causing, inherited variants in a family of three, where there is an affected parent, unaffected parent and a proband. It does this by creating a list of variants present in both affected individuals and subtracting all variants in the unaffected individual. Additional checks are then carried out, allowing variants to be classified, for example as de novo variants, recessive variants, etc.
The Identify QIAseq Exome Causal Inherited Variants in Trio template workflow can be found at:
Template Workflows | Biomedical Workflows () | QIAseq Sample Analysis () | QIAseq DNA workflows () | Identify QIAseq Exome Causal Inherited Variants in Trio ()
If you are connected to a CLC Server via the CLC Workbench, you will be asked where you would like to run the analysis. We recommend that you run the analysis on a CLC Server when possible.
Separate dialog steps are presented for providing the sample data for the proband, the affected parent and the unaffected parent. The names of the steps in the left hand side of the wizard indicate the data that should be entered in that step. For example, sequencing reads for the proband would be selected in the step shown in figure 13.50.
The following dialog helps you set up the relevant Reference Data Set. If you have not downloaded the Reference Data Set yet, the dialog will suggest the relevant data set and offer the opportunity to download it using the Download to Workbench button. (See figure 13.51).
Note that if you wish to Cancel or Resume the Download, you can close the template workflow and open the Reference Data Manager where the Cancel, Pause and Resume buttons are available.
If the Reference Data Set was previously downloaded, the option "Use the default reference data" is available and will ensure the relevant data set is used. You can always check the "Select a reference set to use" option to be able to specify another Reference Data Set than the one suggested.
Both Map Reads to Reference and Fixed Ploidy Variant Detection are configured in separate dialog steps for the proband, the affected parent and the unaffected parent samples. The names of the steps in the left hand side of the wizard, and near the top of each dialog, indicate which sample the parameters apply to. For example, step 9 of the wizard shown in figure 13.51 includes the word "proband". This wizard step, displayed in figure 13.52, has the word "proband" in the title near the top of the dialog.
In the Map Reads to Reference dialog, it is possible to configure masking. A custom masking track can be used, but by default, the masking track is set to GenomeReferenceConsortium_masking_hg38_no_alt_analysis_set, containing the regions defined by the Genome Reference Consortium, which serve primarily to remove false duplications, including one affecting the gene U2AF1. Changing the masking mode from "No masking" to "Exclude annotated" excludes these regions.
The Fixed Ploidy Variant Detection settings:
- Minimum coverage: Only variants in regions covered by at least this many reads are called.
- Minimum count: Only variants that are present in at least this many reads are called.
- Minimum frequency: Only variants that are present at least at the specified frequency (calculated as 'count'/'coverage') are called.
In the next two wizard steps, individual filtering settings can be specified for SNVs and Indels for the proband.
In the final wizard step, choose to Save the results of the workflow and specify a location in the Navigation Area before clicking Finish.
The workflow is also available in the QIAseq Panel Analysis Assistant (see QIAseq Panel Analysis Assistant) under Exome.