Structural variant detection

The LightSpeed Fastq to Germline Variants and LightSpeed Fastq to Somatic Variants tools can infer tandem duplications and inversions from unaligned ends during the realignment step.

This works by

  1. Identifying breakpoints where multiple reads share a common unaligned end at the same position.
  2. Aligning the sequence of identified breakpoints (unaligned end and upstream sequence) to each other and the reference sequence up or downstream of the breakpoints to find likely matches.

Tandem duplications can be detected from pairs of breakpoints that are within 1000 base pairs of each other.

Inversions can be detected from pairs of breakpoint on the same chromosome. The tool only reports the longest possible inversion when multiple breakpoints support similar inversions.

Default inversion detection requires breakpoint support from both sides of the breakpoint, but the option Lenient inversion detection allows detection of inversions where each breakpoint is only supported by reads from one side of the breakpoint. Lenient inversion detection can be relevant when analyzing targeted data. Enabling lenient variant detection can lead to detection of more false positive inversions, and is also likely to increase the processing time.

Tandem duplications are reported in the variant track. Tandem duplications that are inferred fom unaligned ends are annotated with Yes in the variant track column Inferred from unaligned ends. Identified inversions are reported in the inversions track.