Doublets filter

Certain single-cell protocols can assign the same barcode to two or more cells. For example, in droplet-based data, each droplet has a unique barcode, but droplets can contain more than one cell. For data obtained using combinatorial barcoding, there is a chance that two cells will travel together through all barcoding rounds, creating a doublet.

In this dialog of QC for Single Cell, the Doublets filter can be enabled and customized to remove the barcodes that are detected as being assigned to two cells. This filter should be skipped for single-cell protocols that do not generate doublets.

Note that QC for Single Cell cannot remove barcodes that are assigned to more than two cells. However, these are expected to be present at negligible rates.

There are two types of doublets:

Doublet-removal software, which relies on gene expression to detect doublets, cannot identify homotypic doublets, as their expression profiles are indistinguishable from those of other cells. Alternative approaches are required to detect homotypic doublets, such as cell hashing [Stoeckius et al., 2018] and SNPs in multiplexed samples [Kang et al., 2018].

The Doublets filter simulates heterotypic doublets by averaging the expression of two random barcodes that are sufficiently different from each other. These artifical doublets are then used for predicting which of the input barcodes are doublets.

Image qc-doublets
Figure 7.8: The default settings in the Doublets filter dialog.

The following options can be adjusted in the Doublets filter dialog (figure 7.8):

Note: Expected doublets (%) is relative to the number of captured cells, with estimates dependent on the platform:

  • 10x Geonomics. The number of barcodes passing the Empty droplets filter.
  • Parse Biosciences. The number of barcodes passing the Number of reads filter.
If the above estimates are not appropriate, it is recommended to check Specify expected doublets and set the options based on the number of the experiment's target capture cells. For example, if 5000 cells were targeted using 10x Genomics, Expected doublets (%) should be set to 5% and Correction margin (%) to 2.5%.

The Doublets filter receives as input only the high quality cells that pass all filters.

For more details on how doublets are detected, see Doublet calling.