Fusion detection

Fusion genes are detected by:

Image dna_fusion_unaligned_end_mapped
Figure 31.55: Top: Track lists showing the reference sequence, gene track, input reads track, output unaligned ends, and fusion genes track. Left: A read is mapped to Gene1 on chr1. Gene1 is annotated on the plus strand. The read is mapped in the forward direction (illustrated in green) and contains an unaligned end, resulting in a breakpoint at the 5' side. Right: The unaligned end maps to Gene3 on chr2. Gene3 is annotated on the plus strand and the unaligned end is mapped in the forward direction (illustrated in green). The corresponding breakpoint is at the 3' side, placed at the start of the mapped region, which corresponds to the beginning of Gene3. Bottom: Table view of the fusion genes track, showing the two breakpoints.

Whether a breakpoint is considered at the beginning or end of a gene depends on the gene's strand (figures 31.55 and 31.56).

Image dna_fusion_unaligned_end_mapped_forward_reverse
Figure 31.56: Top: Track lists showing the reference sequence, gene track, input reads track, output unaligned ends, and fusion genes track. Left: A read is mapped to Gene4 on chr1. Gene4 is annotated on the plus strand. The read is mapped in the forward direction (illustrated in green) and contains an unaligned end, resulting in a breakpoint at the 5' side. Right: The unaligned end maps to Gene5 on chr2. Gene5 is annotated on the minus strand and the unaligned end is mapped in the reverse direction (illustrated in red). The corresponding breakpoint is at the 3' side, placed at the end of the mapped region, which corresponds to the beginning of Gene5. Bottom: Table view of the fusion genes track, showing the two breakpoints.

Breakpoints can be located outside the gene, up to the the nearest gene along the same strand (figure 31.57):

Image dna_fusion_unaligned_end_mapped_extend
Figure 31.57: Top: Track lists showing the reference sequence, gene track, input reads track, output unaligned ends, and fusion genes track. Left: A read is mapped in the downstream region of Gene6. Gene6 is annotated on the plus strand. The read is mapped in the forward direction (illustrated in green) and contains an unaligned end, resulting in a breakpoint at the 5' side. Right: The unaligned end maps in the upstream region of Gene9 on chr2. Gene9 is annotated on the plus strand and the unaligned end is mapped in the forward direction (illustrated in green). The corresponding breakpoint is at the 3' side, placed at the start of the mapped region, which corresponds to the beginning of Gene9. Bottom: Table view of the fusion genes track, showing the two breakpoints with a non-zero 'Gene distance' and an intergenic 'Breakpoint position'.

The mapping direction of the read and its unaligned end can be on the same or opposite strand of the corresponding gene:

Image dna_fusion_unaligned_end_mapped_switch_of_direction
Figure 31.58: Top: Track lists showing the reference sequence, gene track, input reads track, output unaligned ends, and fusion genes track. Left: Two reads are mapped to Gene10 on chr1. Gene10 is annotated on the plus strand. The reads are mapped in the forward (illustrated in green) and reverse (illustrated in red) directions, respectively, and contain unaligned ends, resulting in a breakpoint at the 5' side. Right: The unaligned ends map to Gene12 on chr2. Gene12 is annotated on the plus strand and the unaligned ends are mapped in the reverse (illustrated in red) and forward (illustrated in green) directions, respectively. The corresponding breakpoint is at the 3' side, placed at the start of the mapped region, which corresponds to the beginning of Gene12. Bottom: Table view of the fusion genes track, showing the two breakpoints and that the reads changed strand orientation in 'Reversal'.