Resolve repeats with conflicts
In the previous section repeats were resolved without excluding any reads that goes through the window. While this lead to a simpler graph, the graph will still contain artifacts, which have to be removed. The next phase removes most of these errors and is similar to the previous phase:- A node is selected as the initial window
- The border is divided into sets using reads going through the window. If we have multiple sets, the repeat is resolved.
- If the repeat cannot be resolved, the border nodes are divided into sets using reads going through the window where reads containing errors are excluded. If we have multiple sets, the repeat is resolved.
- The window is expanded with nodes if possible and step 2 is repeated.
The algorithm described above is similar to the algorithm used in the previous section, except step 3 where the reads with errors are excluded. This is done by calculating an average
where
is the number of reads going through the window and
is the number of distinct pairs of border nodes having one (or more) of these reads connecting them. A second average
is calculated where
is the number of reads going through the window having at least
or more reads connecting their border nodes and
the number of distinct pairs of border nodes having
or more reads connecting them. Then, a read between two border nodes B and C is excluded if the number of reads going through B and C is less than or equal to
given by
data:image/s3,"s3://crabby-images/98ddb/98ddb33bbbab50765fa8f6fc98c978d8fd110596" alt="$\displaystyle limit = \frac{\log(avg_2)}{2} + \frac{avg_2}{16}
$"
data:image/s3,"s3://crabby-images/aa601/aa601c63f21fb89f8e3006f6ca725bebae5f81a2" alt="$ avg_1 = 21 / 3 = 7$"
data:image/s3,"s3://crabby-images/f4477/f4477fc9d840864ae45afbb0b5605ddffcf6f991" alt="$ avg_2 = 20 / 2 = 10$"
data:image/s3,"s3://crabby-images/41287/412878b5fc8b79a970809558395cc4f2c98631c7" alt="$ limit = 1/2 + 10/16 = 1.125$"
Figure 28.9: A repeat with conflicts.
Figure 28.10: Resolving a repeat with conflicts.