Import Expression Matrix

The following expression matrix formats can be imported into an Expression Matrix (Image expression_matrix_track_16_n_p).

Some other commonly encountered formats are specific to a programming language or software package. These can usually be exported from that software package as Loom files. For example:

Options common to all importers

Several options are common to all expression matrix importers. Figure 2.1 shows the Cell Ranger HDF5 importer, which only contains these general options.

Image hdf5importer
Figure 2.1: The Cell Ranger HDF5 importer. The General options are common to all the expression matrix importers.

Options common to Loom and MEX importers

Loom and MEX formats can contain both the total expression, spliced, and unspliced counts. The importers can be configured which type of data to import and produce either an Expression Matrix (Image expression_matrix_track_16_n_p), or an Expression Matrix with spliced and unspliced counts (Image expr_matrix_spliced_unspliced_16_n_p).

Details specific to the Loom importer

Loom allows the exchange of data between different software packages.

A Loom file has an internal structure consisting of a main matrix, optional `layers' of the same size as the main matrix, row and column attributes (describing features and cells, respectively), and sparse graphs describing links between features or between cells. See https://linnarssonlab.org/loompy/format/index.html for details of the format.

The Loom importer expects the Loom format version 3.0.0 and imports the main matrix, row attributes describing feature names and feature identifiers, column attributes, and, if Import spliced/unspliced is ticked, the layers containing the spliced and unspliced counts. All other information in the Loom file is ignored.

Details specific to the MEX importer

The MEX importer requires at least two files to be supplied:

See additional details in the 'Options common to Loom and MEX importers' section on how the total expression is calculated.

See https://math.nist.gov/MatrixMarket/formats.html for details of the Matrix Market Exchange Coordinate Format.

Details specific to the MEX archive importer

The MEX archive importer is provided for convenience. It accepts a .zip, .tar or .tar.gz archive file containing the files required by the MEX importer. In order to uniquely identify each file, these must have a specific name:

The importer can be configured to either import an Expression Matrix (Image expression_matrix_track_16_n_p), or an Expression Matrix with spliced and unspliced counts (Image expr_matrix_spliced_unspliced_16_n_p). For the first option, 'Import expressions' must be enabled, while for the second option, 'Import spliced/unspliced' must be enabled.

Either the 'Matrix file', or 'Matrix file (spliced)' and Matrix file (unspliced)' can be missing from the archive, depending on how the importer has been configured.

See additional details in the 'Options common to Loom and MEX importers' section on how the total expression is calculated.

Details specific to the CSV importer

The CSV/TXT importer supports import of text data in a full table format.

Working with spreadsheets Be careful to check that all the data is present before import if the file originates from a spreadsheet program. Such programs often impose limits on the number of rows and columns.