Import Cell Clusters
The Import Cell Clusters tool can import clusters for each cell. The importer produces Cell Clusters () that can be used to define groups of cells for use in many tools such as Differential Expression for Single Cell. Cell Clusters can also be visualized in a Dimensionality Reduction Plot.
Often the same file can be imported as either Cell Clusters or Cell Annotations. The principal advantage of Cell Clusters is that they can be edited within the Dimensionality Reduction Plot.
The importer can be found here:
Import () | Import Cell Clusters ().
The following options are available:
- Data file. A single file in .csv, .tsv or .xlsx format. The first row in the file is used as the name of the columns of the imported table. Each subsequent row in the file should describe a cell (though empty lines are ignored).
- First column defines sample. When this is enabled, the first column in the file will be treated as the sample name, and the second column defines the barcode. Subsequent columns are annotations on the cells.
- Cell format and Sample. How cells are identified, see Cell format in importers for more details.
- Matrix (Optional). When a matrix is supplied, the sample name is taken from the matrix. If the previous options also provide sample names:
- If different from the matrix, the importer fails with an error. This can be useful when checking that the file being imported matches the supplied matrix.
- Rows in the file describing cells that are not in the matrix are skipped.
Note that the sample name can only be set using only one of the First column defines sample, Cell format, or Sample options. The Matrix option is mandatory if none of the previous options is used for defining the sample and then the matrix must be for one sample only.
- Map clusters to QIAGEN Cell Ontology. When this is enabled, clusters will be translated, if possible, to the QIAGEN Cell Ontology (see The QIAGEN Cell Ontology). The translation attempts to match each cluster with a cell type from the ontology based on the name and known synonyms. For example, `alveolar epithelial cells' are also called `pneumocytes'. If the data file contains an `alveolar epithelial cells' cluster and this option is selected, then in the imported file the cluster will be named `pneumocytes'. This option can be useful when standardizing cell annotations from different sources. It is especially recommended if the imported data will be used to extend a QIAGEN Cell Type Classifier using the Train Cell Type Classifier tool (Train Cell Type Classifier). See figures 2.5 and 2.6 for an example.
Figure 2.5: An example of a file that could be imported with Import Cell Clusters. The file contains three different clusterings.
Figure 2.6: The result of importing the file shown in figure 2.5 using the option `Map clusters to QIAGEN Cell Ontology'. Note that all cell types have been translated to terms in the QIAGEN Cell Ontology. For example, `T cells' have been standardized to `T lymphocytes', and `alveolar epithelial cells' have been standardized to `pneumocytes'.