Compatibility with CLC Genomics Workbench tools
Read mappings produced by the tools LightSpeed Fastq to Germline Variants, LightSpeed Fastq to Somatic Variants and LightSpeed Fastq to Somatic Variants Tumor Normal differ in some fundamental ways from read mappings produced by the Map Reads to Reference tool, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Map_Reads_Reference.html. The main differences are related to how base quality scores and unaligned ends are handled:
- Base quality scores The Lightspeed tools access and use the base quality scores during variant calling, but for efficiency reasons the base quality scores on the individual bases are not kept and stored on individual bases in the final read mapping. CLC Genomics Workbench tools that rely on quality scores being available on bases in a read mapping (such as the variant callers Fixed Ploidy Variant Detection, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Fixed_Ploidy_Variant_Detection.html and Low Frequency Variant Detection, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Low_Frequency_Variant_Detection.html) cannot be expected to perform as well on a read mapping produced by a Lightspeed tool as on a read mapping produced with the Map Reads to Reference tool available in CLC Genomics Workbench.
- Unaligned ends The Lightspeed tools include a structural variant detection step and a subsequent realignment step. In the structural variant detection step, structural variants are inferred from unaligned ends, and the realignment step uses the inferred structural variants during realignment. The realignment procedure differs from that used by the Local Realignment tool, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Local_Realignment.html (available in CLC Genomics Workbench) in that unaligned ends that equally well support different alleles are kept unaligned. CLC Genomics Workbench tools that rely on unaligned ends in a read mapping (such as the InDels and Structural Variants tool, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=InDels_Structural_Variants.html) will therefore behave differently on a read mapping produced with Lightspeed tools compared to a read mapping produced with the Map Reads to Reference tool.