Analyze QIAseq Somatic WGS
The Analyze QIAseq Somatic WGS (Illumina) template workflow is designed to call somatic variants from DNA data produced with the library kits QIAseq Multimodal DNA/RNA Library Kit (without UMIs), QIAseq FX DNA Library Kit, and QIAseq Ultralow Input Library Kit.
The workflow includes all necessary steps for processing and analyzing the reads:
- Various statistics summarizing and visualizing the input reads are produced using QC for Sequencing Reads.
- Reads are trimmed using Trim Reads.
- Reads are mapped to the human reference genome using Map Reads to Reference.
- A guidance track is generated from the mapped reads using Structural Variant Caller.
- An improved mapping is obtained by realigning the mapped reads using the guidance track and Local Realignment.
- Variants are detected from the improved mapping using Low Frequency Variant Detection.
- The variants are annotated with various information, such as overlap with repeat/homopolymer regions and genes, and are subsequently filtered to remove likely artifacts using Filter on Custom Criteria.
- A summary report is created using Create Sample Report.
- A report is optionally prepared for uploading to QCI Interpret using Prepare QCI Interpret Upload.
Launching the workflow
To run this workflow, go to:
Workflows | Template Workflows | Biomedical Workflows (
) | QIAseq Sample Analysis (
) | QIAseq DNA Workflows (
) | Analyze QIAseq Somatic WGS (Illumina) (
)
See Launching workflows individually and in batches for general information.
Options can be configured in the following wizard steps:
- Choose where to run. If you are connected to a CLC Server via the CLC Workbench, you will be asked where you would like to run the analysis. We recommend that you run the analysis on a CLC Server when possible.
- Specify workflow path. The following path can be configured:
- Prepare for QCI Interpret. Prepare a report that can later be uploaded to QCI Interpret or QCI Interpret Translational.
- Select Reads. Select the input reads. When analyzing more than one sample at a time, check Batch in the lower left corner of the dialog.
- Specify reference data handling. Select the QIAseq DNA Hybrid Capture and WGS hg38 Reference Data Set, see Reference data management for details.
- Configure batching, if running the workflow in batch mode. Define the batch units.
- Batch overview, if running the workflow in batch mode. Verify that the batching is as intended.
- Map Reads to Reference. Configure masking. By default, regions defined by the Genome Reference Consortium are excluded to remove false duplications, including one involving U2AF1.
- Filter on Custom Criteria. Configure how variants should be filtered. The default options are optimized for samples with relatively high quality and coverage providing a strong balance between sensitivity and precision. For lower-quality or lower-coverage samples, when seeking a different sensitivity-precision balance, or for detecting low-frequency variants, additional filters or adjusted thresholds might be needed. See Filter on Custom Criteria for details on how to configure the options.
- Prepare QCI Interpret Upload, if preparing a QCI Interpret report. Specify sample name, subject ID, and project.
- Result handling. Choose if a workflow result metadata and/or log should be saved.
- Save location for new elements. Choose where to save the data, and press Finish to start the analysis.
Launching using the QIAseq Panel Analysis Assistant
The workflow is also available in the QIAseq Panel Analysis Assistant under DNA/RNA Library Kit.
Subsections
