Predicting Structural Variants

Having created breakpoint signatures (LBs and RBs), we use these in a procedure, which inspects the called breakpoint signatures, and attempts to match and combine them to infer possible underlying structural variant signatures. Based on the inferred structural variant signatures, structural variants are predicted, and annotations created. The procedure for inspecting breakpoint signatures and inferring structural variants is rather intricate and described in detail in Section 26.4.4 below. In short, the consensus sequences of the unaligned ends at the detected breakpoints are mapped to the reference sequences around other breakpoints as well as to the consensus sequence of other unaligned ends. A structural variant is predicted when the mapping patterns found for the unaligned ends correspond to one of the following 'structural variant signatures' described in Figures 26.22, 26.23, 26.24, 26.25, 26.26, 26.27, 26.28, 26.29 and 26.30.

Image deletion-crossmapped
Figure 26.22: A deletion with cross-mapping breakpoint evidence.

Image deletion-selfmapped
Figure 26.23: A deletion with selfmapping breakpoint evidence.

Image insertion-closebreakpoint
Figure 26.24: An insertion with close breakpoint evidence.

Image insertion-crossmapped
Figure 26.25: An insertion with cross-mapped breakpoints evidence.

Image insertion-selfmapped
Figure 26.26: An insertion with selfmapped breakpoint evidence.

Image insertion-tandem
Figure 26.27: An insertion with breakpoint mapping evidence corresponding to a 'Tandem duplication'.

Image inversion-crossmapped
Figure 26.28: The unaligned end mapping pattern of an inversion.

Image replacement
Figure 26.29: The unaligned end mapping pattern of a replacement.

Image translocation
Figure 26.30: The unaligned end mapping pattern of a translocation.

The 'Indels and Structural variants tool' has a number of outputs. The user may choose to have a report created. The report summarizes the number of breakpoints detected, provides some characteristics of the breakpoint, and on the numbers and types of structural variants detected. In addition to the report, the user may specify to have (1) the breakpoints, (2) the InDels and (3) the structural variants reported. These can be reported either as tracks or as tables, depending on the users choice. When the track option is chosen, the breakpoints and structural variants are reported in feature tracks, and the InDels in a variant track. The InDel track contains the small to medium sized insertions and deletions (shorter than approximately 220 bp) and for which the algorithm was able to identify the allele sequence (that is, the exact inserted sequence, or the exact deleted sequence).

Image standaloneVsTracks
Figure 26.31: Example of the result of an analysis on a standalone read mapping (to the left) and on a reads track (to the right).

Typically, there will be called breakpoint signatures that are not found to stem from a structural variant. There may be a number of reasons for that: (1) the unaligned ends from which the breakpoint signature was derived might not be caused by an underlying structural variant, but merely be due to read mapping issues or noise, or (2) the breakpoint(s) which the detected breakpoint should have been matched to was/were not detected, and therefore no matching breakpoint(s) were found. Breakpoints may go un-detected either because of lack of coverage in the breakpoint region or because they are located within regions with exclusively non-uniquely mapped reads (only unaligned ends of uniquely mapping reads are used).



Subsections