Subsections

Identify Somatic Variants from Tumor Normal Pair (TAS)

The Identify Somatic Variants from Tumor Normal Pair (TAS) ready-to-use workflow can be used to identify potential somatic variants in a tumor sample when you also have a normal/control sample from the same individual.

When running this workflow the reads are mapped and the variants identified. An internal workflow removes germline variants that are found in the mapped reads of the normal/control sample and variants outside the target region are removed as they are likely to be false positives due to non-specific mapping of sequencing reads. Next, remaining variants are annotated with gene names, amino acid changes, conservation scores and information from relevant databases like ClinVar (variants with clinically relevant association). Finally, information from dbSNP is added to see which of the detected variants have been observed before and which are completely new.

Before starting the workflow, you will need to import in the workbench a file with the genomic regions targeted by the amplicon or hybridization kit. Such a file (a BED or GFF file) is usually available from the vendor of the enrichment kit and sequencing machine. Use the Import | Tracks tool to import it in your Navigation Area.

Run the Identify Somatic Variants from Tumor Normal Pair (TAS) workflow

To run the Identify Somatic Variants from Tumor Normal Pair (TAS) tool, go to:

        Toolbox | Ready-to-Use Workflows | Targeted Amplicon Sequencing (Image targeted_sequencing_closed_16_n_p) | Somatic Cancer (Image somatic_folder_closed_16_n_p) | Identify Somatic Variants from Tumor Normal Pair (TAS) (Image filter_somatic_var_tas_16_n_p)

  1. Go to the toolbox and double-click on the Identify Somatic Variants from Tumor Normal Pair (TAS) ready-to-use workflow.

  2. First, (figure 22.21), select the tumor sample reads.

    Image filter_somatic_variants_from_tumor_normal_step1_tas
    Figure 22.21: Select the tumor sample reads.

  3. In the next wizard step, specify the normal sample reads.

  4. In the next dialog, select which data set should be used to identify variants (figure 22.22).

    Image identify_somatic_variants_tas
    Figure 22.22: Choose the relevant reference Data Set to identify variants.

  5. The following 2 steps allow you to restrict the calling of Indels and Structural Variants to the targeted regions, both for tumor and normal reads (figure 22.23).

    Image filter_somatic_variants_from_tumor_normal_step3_tas
    Figure 22.23: Specify the target regions track.

  6. Set the parameters for the Low Frequency Variant Detection step (figure 22.24).

    Image filter_somatic_variants_from_tumor_normal_step5_tas
    Figure 22.24: Specify the settings for the variant detection.

    For a description of the different parameters that can be adjusted, see http://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Low_Frequency_Variant_Detection.html. If you click on "Locked Settings", you will be able to see all parameters used for variant detection in the ready-to-use workflow.

  7. In the following 2 wizard steps, you can select your target regions track to be used for reporting the performance of the targeted re-sequencing experiment for the tumor and normal samples successively (figure 22.25). The targeted region track should be the same as the track you selected in the previous wizard steps. Variants found outside the targeted regions will not be included in the output that is generated with the ready-to-use workflow.

    Image filter_somatic_variants_from_tumor_normal_step4_tas
    Figure 22.25: Select your target region track.

    For a description of the different parameters that can be adjusted, see http://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=QC_Targeted_Sequencing.html. If you click on "Locked Settings", you will be able to see all parameters used for the QC for Targeted Sequencing tool in the ready-to-use workflow.

  8. In the Remove Variants Present in Control Reads step, you can adjust the settings for removal of germline variants (figure 22.26).

    Image filter_somatic_variants_from_tumor_normal_step6_tas
    Figure 22.26: Specify setting for removal of germline variants.

  9. In the last wizard step you can check the selected settings by clicking on the button labeled Preview All Parameters. In the Preview All Parameters wizard you can only check the settings, and if you wish to make changes you have to use the Previous button from the wizard to edit parameters in the relevant windows.

  10. Choose to Save your results and click Finish.

Output from the Identify Somatic Variants from Tumor Normal Pair (TAS) workflow

The following outputs are generated:

Image identify_somatic_variants_genomebrowserview_tas
Figure 22.27: The Track List presents all the different data tracks together and makes it easy to compare different tracks.