QIAGEN Bioinformatics Manuals

The expression browser

An Expression Browser is shown in figure 34.4.

Image expressionbrowser_table_simple
Figure 34.4: Expression browser table when no statistical comparison or annotations resources were provided.

Each row represents a gene or a transcript, defined by its name, the chromosome and the region where it is located, as well as an identifier linking to the relevant online database.

The expression values for each sample - or aggregation of samples - can be given by total counts, RPKM, TPM or CPM (TMM-adjusted). These measurements differ from each other in three key ways:

RPKM and TPM measure the number of transcripts whereas total counts and CPM measure the number of reads. The distinction is important because in an RNA-Seq experiment, more reads are typically sequenced from longer transcripts than from shorter ones.
RPKM, TPM and CPM are normalized for sequencing-depth so their values are comparable between samples. Total counts are not normalized, so values are not comparable between samples.
CPM (TMM-adjusted) is obtained by performing TMM normalization, followed by CPM without using a prior. TMM-adjustment depends on all samples included in the browser. In contrast, RPKM and TPM values are not TMM-adjusted, and thus are independent of other samples in the browser.

How do I get the normalized counts used to calculate fold changes? The CPM expression values are most comparable to the results of the Differential Expression for RNA-Seq tool. However, normalized counts are not used to calculate fold changes; instead the Differential Expression for RNA-Seq tool works by fitting a statistical model (which accounts for differences in sequencing-depth) to raw counts. It is therefore not possible to derive these fold changes from the CPM values by simple algebraic calculations.

It is possible to display the values for individual samples, or for groups of samples as defined by the metadata. Using the drop down menus in the "Grouping" palette of the Side Panel, you can choose to group samples according to up to three metadata layers as shown in figure 34.4.

When individual samples are aggregated, an additional "summary statistic" column can be displayed to give either the mean, the minimum, or the maximum expression value for each group of samples. The table in figure 34.4 shows the mean of the expression values for the first group layer that was selected.

If one or more statistical comparisons are provided, extra columns can be displayed in the table using the "Statistical comparison" section of the Settings panel (figure 34.5). The columns correspond to the different statistical values.

Image expressionbrowser_table_statcomp
Figure 34.5: Expression browser table when a statistical comparison is present.

If an annotation database is provided, extra columns can be displayed in the table using the "Annotation" section of the Settings panel (figure 34.6). Which columns are available depends on the annotation file used. When using a GO annotation file, the GO biological process column will list for each gene or transcript one or several biological processes. Click on the process name to open the corresponding page on the Consortium for Gene Ontology webpage. It is also possible to access additional online information by clicking on the PMID, RefSeq, HGNC or UniProt accession number when available.

Image expressionbrowser_table_go
Figure 34.6: Expression browser table when a GO annotation file is present.

Select the genes of interest and use the button present at the bottom of the table to highlight the genes in other views (volcano plot for instance) or to copy the genes of interest to a clipboard.

Browse the manual

The expression browser