Immune Repertoire and Expression Analysis from Clonotypes and Matrix

The workflow Immune Repertoire and Expression Analysis from Clonotypes and Matrix takes one or more Expression Matrices (Image expression_matrix_track_16_n_p) and Cell Clonotypes (Image cell_clonotypes_16_n_p) as input to jointly analyze scRNA-seq and scTCR-seq data originating from the same sample. The two types of inputs are sent on two different paths, for scRNA-seq and scTCR-seq data, respectively.

Note that the sample in the input elements (Expression Matrix and Cell Clonotypes) must be the same for the cells originating from the same sample. This can be achieved in different ways, depending on how the elements were generated:

The scRNA-seq path follows the same analysis described in Expression Analysis from Matrix, while scTCR-seq path ensures that clonotypes are filtered accordingly.

The workflow uses iterate functionality and allows for a combined analysis of multiple samples to produce:

The workflow can be found in the toolbox here:

        Workflows (Image sc_workflow_folder_open_16_n_p) | From Imported Data (Image sc_wf_from_imported_folder_open_16_n_p) | Immune Repertoire and Expression Analysis from Clonotypes and Matrix (Image sc_tcr_rna_from_matrix_clonotypes_16_n_p)

If you are connected to a CLC Server via your Workbench, you will be asked where you would like to run the analysis. We recommend that you run the analysis on a CLC Server when possible.

Choose either one or more Expression Matrix (Image expression_matrix_track_16_n_p) and Cell Clonotypes (Image cell_clonotypes_16_n_p) or Select files for import and select the formats that are compatible with the selected inputs. Read more about the import options in Data import.

For the scRNA-seq path, a number of options are customizable, see Expression Analysis from Matrix. For the scTCR-seq path, only clonotype filtering is customizable, as described in Immune Repertoire Analysis from Reads (10xVDJ). Adjustments can be made in a workflow copy, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Editing_existing_workflows.html.

The workflow can be run using Single Cell hg38 (Ensembl) or Single Cell Mouse (Ensembl) reference data sets (see The Reference Data Manager).

Note: Reference data elements cannot be configured during workflow execution. If other elements than those provided in the default reference data sets are needed a custom reference data set can be used, see  https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Custom_Sets.html. When creating custom reference data sets, the chosen gene track needs to match the gene annotations used for training the provided Cell Type Classifier (Image cell_type_classifier_16_n_p), see Features used for training and prediction.

The workflow allows the analysis of multiple samples and you can specify metadata during workflow execution. This is converted to cell annotations and can be used for coloring the cells in the Dimensionality Reduction Plot. However, the workflow expects each sample to be present in just one Expression Matrix, and attempting to define batch units containing more than one Expression Matrix will lead to a failure during execution. Similarly, each sample is expected to be present in just one Cell Clonotypes element. For more details on configuring workflow execution with metadata, see  https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Batching_part_workflow.html. Make sure to inspect the batch overview to check that the analysis will be performed correctly.



Subsections