The output of Normalize Single Cell Data

Normalize Single Cell Data produces the following outputs:

The report includes the following sections.

Summary

Contains a table with:

See The Normalize Single Cell Data algorithm for more details.

Variation of normalized expressions

Contains the following subsections:

Fitted values

The remaining sections of the report contain the fitted values for each model term: 'Intercept', 'Log10(theta)' (figure 7.28), and a term for each batch effect (figure 7.29).

These sections can be useful for assessing whether the model is likely over-parameterized, as it easy to see the number of terms. Each section contains a plot with the fitted values and, where relevant, the regularized trend.

The plots for the batch effect terms show the fold change for the corresponding batch relative to the baseline (figure 7.29). It can be useful to inspect the fold change of individuals genes. For this, open the plot outside the report by double-clicking on it, switch to the table view (Image table_16_n_p), and sort the table by fold change. For example, when correcting for cell cycle effects, genes involved in the cell cycle are expected to exhibit large positive and negative fold changes.

Image fittedcoefficientdispersion
Figure 7.28: Fitted dispersion $ \theta $. The red line is the regularized trend, providing a more robust estimate of the dispersion. The expression of some genes is consistent with a Poisson distribution, seen here by the band of genes at $ \theta =10^3$.

Image fittedcoefficientbatch
Figure 7.29: Batch effect term. The y-axis shows the natural logarithm of the fold change in the CL sample relative to the baseline SM2 sample. The large fold changes, concentrated in two bands at y=20 and y=-20, correspond to genes that are not expressed in at least one of the samples.