Immune Repertoire and Expression Analysis from Clonotypes and Matrix

The workflow Immune Repertoire and Expression Analysis from Clonotypes and Matrix takes one or more Expression Matrix () / () and TCR Cell Clonotypes (

) or BCR Cell Clonotypes (

) elements as input to jointly analyze scRNA-Seq and scV(D)J-Seq data originating from the same sample or samples. The Expression Matrices and Cell Clonotypes are sent on two different paths, for scRNA-Seq and scV(D)J-Seq data, respectively. The scRNA-Seq path follows the same analysis described in Expression Analysis from Matrix, while the scV(D)J-Seq path ensures that clonotypes are filtered accordingly.

The workflow uses the iterate functionality and allows for a combined analysis of multiple samples to produce:

a single, multi-sample, normalized Expression Matrix () / ();
a single, multi-sample, filtered TCR Cell Clonotypes () or BCR Cell Clonotypes () element;
a Dimensionality Reduction Plot () associated with the automated clusters, predicted cell types, identified clonotypes and additional cell annotations;
a Heat Map (), a Dot Plot (), and a Violin Plot () with the predicted cell types as cell groups;
a Cell Abundance Heat Map () with the automated clusters and predicted cell types as cell groups.
If velocity analysis is run:
- a Phase Portrait Plot () with per gene information on the velocity dynamics;
- a Velocity Genes Scores () element allowing identification of velocity genes driving the dynamics.

The workflow can be found here:

Template Workflows | Single Cell Workflows () | From Imported Data () | Immune Repertoire and Expression Analysis from Clonotypes and Matrix ()

If you are connected to a CLC Server via the CLC Workbench, you will be asked where you would like to run the analysis. We recommend that you run the analysis on a CLC Server when possible.

Using a Fork element, the workflow offers the option to run velocity analysis. To enable this, set Velocity Analysis to Run in the Specify Workflow Path wizard step. See https://resources.qiagenbioinformatics.com/manuals/clcgenomics/current/index.php?manual=Fork.html for details.

Choose either one or more Expression Matrix () / () and TCR Cell Clonotypes () or BCR Cell Clonotypes () elements or Select files for on-the-fly import and select the format that is compatible with the selected inputs. Read more about import options in On-the-fly import in workflows.

Note that the sample in the inputs must be the same for cells originating from the same sample. This can be achieved in different ways, depending on how the elements were generated:

If the input elements were generated in the CLC Single Cell Analysis Module, the sample name can be set when running Annotate Single Cell Reads.
If the input elements are imported, the sample name can be set during import through the Cell format or Sample options, see Cell format in importers.
The tool Update Single Cell Sample Name () can be used for updating the sample name in either input element, see Update Single Cell Sample Name.

For the scRNA-Seq path, a number of options are customizable, see Expression Analysis from Matrix. For the scV(D)J-Seq path, only clonotype filtering is customizable, as described in Immune Repertoire Analysis from Reads (10xV(D)J). Adjustments can be made in a workflow copy, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Creating_editing_workflows.html.

The workflow can be run using Single Cell hg38 (Ensembl) or Single Cell Mouse (Ensembl) reference data sets (see Reference data management).

Note: Reference data elements cannot be configured during workflow execution. If other elements than those provided in the default reference data sets are needed, a custom reference data set can be used, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Custom_Sets.html. When creating custom reference data sets, the chosen gene track needs to match the gene annotations used for training the provided Cell Type Classifier () (see Features used for training and prediction). Reference V, D, J and C gene segments for other species or for B cells can be imported using Import Immune Reference Segments (see Import Immune Reference Segments).

The workflow allows the analysis of multiple samples and you can specify metadata during workflow execution. This is converted to cell annotations and can be used for coloring the cells in the Dimensionality Reduction Plot. However, the workflow expects each sample to be present in just one Expression Matrix, and attempting to define batch units containing more than one Expression Matrix will lead to a failure during execution. Similarly, each sample is expected to be present in just one Cell Clonotypes element. For more details on configuring workflow execution with metadata, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Running_workflows_in_batch_mode.html. Make sure to inspect the batch overview to check that the analysis will be performed correctly.

Subsections

Output from Immune Repertoire and Expression Analysis from Clonotypes and Matrix

Browse the manual

Immune Repertoire and Expression Analysis from Clonotypes and Matrix