Simplifying filter cascades

Prior to version 26.0 of Biomedical Genomics Analysis, DNA workflows performing variant detection typically relied on long filter cascades to remove likely false positive variants. These cascades consisted of multiple filter units, each containing:

After the filter units, the filter cascade typically also contained:

From version 26.0 of CLC Genomics Workbench, Filter on Custom Criteria supports complex filtering strategies, allowing workflows to be streamlined so that filtering can be performed in a single step.

This section uses a copy of the Identify QIAseq DNA Somatic Variants template workflow provided by version 25.0 of Biomedical Genomics Analysis to illustrate how such workflows can be updated.

Before making any changes, we recommend saving a copy of the workflow. This allows verifying that the updated workflow produces the same results as the original.

Identify filter units

Locate the start of the filter cascade and each of the filter units within it (figure 12.6). It can be useful to create a workflow group for each unit.

Image filter_cascade_units
Figure 12.6: The filter cascade from "Identify QIAseq DNA Somatic Variants (Illumina)" template workflow distributed with version 25.0. Each filter unit is placed in a collapsed workflow group. The units are followed by a "Filter on Custom Criteria" element and a "Filter Homozygous Reference Variants" element.

Add to the workflow a Filter on Custom Criteria element, referred to here as the consolidated filter step. This element will ultimately replace all filter units and will contain one filter group for each unit.

Incorporate filter units

For all Filter on Custom Criteria elements in the unit using Match all, or using Match any with only one criterion:

If the unit also contains Filter on Custom Criteria elements using Match any with more than one criterion:

When all Filter on Custom Criteria elements in the unit have been incorporated, we recommend collapsing the group.

Finalize the consolidated filter step

When all filter units have been incorporated:

Image consolidated_filter_step
Figure 12.9: The consolidated filter step has the entire filter cascade from figure 12.6 incorporated. The last group is expanded and corresponds to the last "Filter on Custom Criteria" element, which was not part of any unit and used "Match any".