Map Long Reads to Reference

The Map Long Reads to Reference tool enables aligning long reads to a reference with minimap2 [Li, 2018].

To run the tool, go to:

        Tools | Resequencing Analysis (Image resequencing) | Map Long Reads to Reference (Image map_long_to_reference_16_h_p)

Select one or more sequence lists containing long reads.

In the References dialog, select one or more reference sequences. You can select either individual sequences, a list of sequences or a sequence track as reference (figure 31.12).

Image map_long_reads_dialog_step1
Figure 31.12: Specifying the reference sequences and masking.

The next part of the dialog lets you mask the reference. Masking means that selected regions of the reference are ignored during read mapping. Reads will not be mapped to these regions, but the full reference is still included in the output.

Masking can be useful when reads are expected to originate only from specific regions, for example when working with targeted sequencing data. However, masking should be used with care. If reads originate outside the selected regions, they may be mapped to less suitable locations, which can affect downstream analyses such as variant detection.

Masking large numbers of regions, such as repetitive sequences, is generally not recommended. Repeats are handled automatically during mapping, and masking them may reduce performance and lead to incorrect read placement.

To mask a reference using regions defined in a masking track, choose:

Then click the Browse (Image browse) button to select a track for masking.

If your regions are stored as sequence annotations, they can be converted to a track.

In the Mapping options dialog, set the read mapping parameters (figure 31.13):

Image map_long_reads_dialog_step2
Figure 31.13: Map Long Reads to Reference mapping options.

For guidance on adjusting match cost parameters, see Mapping parameters. Additional information on parameters are available from the minimap2 documentation.



Subsections