Import Peak Count Matrix

Several formats can be imported into a Peak Count Matrix (

) using the following importers:

Cell Ranger HDF5: Import Peak Count Matrix in Cell Ranger HDF5 format ();
MEX: Import Peak Count Matrix in MEX format ();

The importers can be found here:

Import () | Single Cell Data () | Import Peak Count Matrix ()

The importers can be used in workflows. See On-the-fly import in workflows for details.

General options

Image atac_import_general_options
Figure 4.9: The Cell Ranger Peak Count importer. The General options are common to all the peak matrix importers.

The following options are common to all peak matrix importers (figures 4.9 and 4.10):

Gene track Positions in the imported data are matched with the provided track.
Matching is used to:
- View the Peak Count Matrix as a Track. For more information on tracks, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Tracks.html.
- Identify nearby genes if these are not explicitly supplied.
Cell format and Sample: How cells are identified. See Cell format in importers for more details.

Figure 4.10: Additional options common to all the peak matrix importers.
Nearby genes Nearby genes are determined in one of two ways:
- By searching for nearby genes using the Gene track and an accompanying mRNA track
- By supplying nearby genes in a selected tab-separated file (.tsv).
The file must consist of either:
- 6 columns: name of the chromosome prefixed with "chr" (e.g., "chr1"), start and end position of the peak, the name of the gene, distance and type of peak.
- 4 columns: name of the chromosome together with start and end positions of the peak (e.g., "chr1:123-456"), the name of the gene, distance and type of peak.
The first line must be column headers.
The distance is the number of base positions from the start or end of the peak to the start or end of the gene, whichever is closest. It is signed and will be negative if the peak is before the gene.
The type of the peak can be either "promoter" or "distal". Other values are ignored.
If there are multiple nearby genes per peak, they can either be on separate lines or be grouped on one line, with gene name, distance and peak types separated by semi-colon.
Transcription factors If enabled, transcription factors will be imported from the selected tab-separated bed file. Each line consists of the name of the chromosome (e.g., "chr1"), start and end positions of the peak, and the name of the transcription factor.
If not enabled, the peak matrix will not have transcription factors.

The data to be imported may either consist of peak data only or it may be a mixture of peaks and gene expressions, as is the case for 10x Multiome files. In the latter case, the gene expressions must be imported into a separate Expression Matrix (see Import Expression Matrix).

HDF5 importer

The Cell Ranger HDF5 importer requires one file to be supplied:

Peak count matrix The peak count matrix in HDF5 format.

MEX importer

The MEX importer requires three files to be supplied:

Barcodes file A file with the extension .tsv and tab-separated columns, with one row per barcode. It can optionally contain a header. The barcodes are read from the first column. Empty lines are ignored.
Use the Cell format option to control how the barcodes should be interpreted - for example if it also includes information about the sample.
Feature or peak file This should be one of:
- A feature file with extension .tsv and six tab-separated columns, with one row per feature or peak. These are relevant for mixtures of peaks and expressions, e.g. 10x Multiome. The columns are: identifier (e.g., "chr1:123-456"); name (same as identifier for peaks); type, e.g. "Gene Expression" or "Peaks"; chromosome (e.g., "chr1"); start and end position of the feature or peak. The file can optionally contain a header. Empty lines are ignored.
- A peak file with extension .bed and three tab-separated columns: the chromosome, start and end position.
Matrix file A file containing the expression with the extension .mtx in the Matrix Market Exchange Coordinate Format.

See https://math.nist.gov/MatrixMarket/formats.html for details of the Matrix Market Exchange Coordinate Format.

Browse the manual

Import Peak Count Matrix

General options

HDF5 importer

MEX importer