Merging of clonotypes

The three merging options can be used together.

The following is an example of using both Merge clonotypes with similar CDR3 and Merge clonotypes without C segment. Let us consider two clonotypes with the same identified V and J segments, sufficiently similar CDR3 nucleotide sequences, and where only one has an identified C segment:

Multiple clonotypes with similar CDR3 sequences can be merged in different ways, depending on the values set for the different options. Consider the example from figure 7.9:

Image clonotypes_for_merging
Figure 7.9: Three clonotypes sharing the identified segments and with similar CDR3 sequences. Top: Clonotypes table view. Bottom: Clonotypes alignment view for all three clonotypes, highlighting the differences in the CDR3 sequence.

Consecutive merges

Clonotypes merging is performed consecutively and hence multiple clonotypes may be merged into one. Setting Minimum count ratio to 1.5 and Maximum errors to 1:

Merges into multiple clonotypes

A clonotype may be merged into multiple clonotypes. Setting Minimum count ratio to 4 and Maximum errors to 2, clonotype 3 is merged into both clonotypes 1 and 2, and its count is distributed between clonotypes 1 and 2, proportional to their respective counts:

Clonotype 2 is not merged into clonotype 1 because 26 / 13 < 4.

Identical due to preceding merging

Two originally different clonotypes can end up being identical due to preceding merging, and then they will be merged into one. Consider the example from figure 7.10:

Image identical_due_to_merges
Figure 7.10: Three clonotypes sharing the identified segments, with similar CDR3 sequences, and where one is missing the C segment. Top: Clonotypes table view. Bottom: Clonotypes alignment view for all three clonotypes, highlighting the differences in the CDR3 sequence and the shorter reads not covering the C segment.

Running the tool with both Merge clonotypes with similar CDR3 and Merge clonotypes without C segment, setting Minimum count ratio to 2, Maximum errors to 1 and Minimum count for clonotypes with C segment to 10: