UMI grouping

All of the LigthSpeed tools can group reads based on Unique Molecular Identifiers (UMIs).

The UMI sequence is recorded and removed from the reads before trimming and mapping. After the reads have been mapped, reads with similar UMI sequence and mapping position are merged into a consensus UMI read.

The consensus is calculated following these rules:

The following options can be used to adjust how raw reads are grouped into UMI reads:

Note that the maximum number of reads used for creating a UMI consensus read is 20,000. Therefore, UMI groups with more than 20,000 reads will be merged into more than one consensus UMI read.