Normalization and clustering

The expression values are filtered and normalized as follows:

See RNA-Seq normalization for more details.

The samples and features, as relevant, are hierarchically clustered based on the similarity of their expression profiles, as follows:

  1. Create clusters containing one sample/feature.
  2. Calculate the distances between all clusters.
  3. Merge the two closest clusters into one.
  4. Repeat until only one cluster remains, containing all the samples/features.

The hierarchical cluster forms tree structures displayed along the rows and columns of the heat map. The tree branch lengths represent the distances between clusters.

The distance between two clusters is determined using one of the following linkage types:

The distance between two samples/features is calculated using one of the following distance measures: