Result of hierarchical clustering of samples

The result of a sample clustering is shown in figure 34.33.

Image sample_clustering_result_web
Figure 34.33: Sample clustering.

If you have used an experiment (Image experiment) and ran the non-workflow version of the tool, the clustering is added to the experiment and will be saved when you save the experiment. It can be viewed by clicking the Show Heat Map (Image heatmap) button at the bottom of the view (see figure 34.34).

Image experiment_editors
Figure 34.34: Showing the hierarchical clustering of an experiment.

If you have run the workflow version of the tool, or selected a number of samples ( (Image array) or (Image rnaseq)) as input, a new element will be created that has to be saved separately.

Regardless of the input, the view of the clustering is the same. As you can see in figure 34.33, there is a tree at the bottom of the view to visualize the clustering. The names of the samples are listed at the top. The features are represented as horizontal lines, colored according to the expression level. If you place the mouse on one of the lines, you will see the names of the feature to the left. The features are sorted by their expression level in the first sample (in order to cluster the features, see Hierarchical clustering of features).

Researchers often have a priori knowledge of which samples in a study should be similar (e.g. samples from the same experimental condition) and which should be different (samples from biological distinct conditions). Thus, researches have expectations about how they should cluster. Samples that are placed unexpectedly in the hierarchical clustering tree may be samples that have been wrongly allocated to a group, samples of unintended or unclean tissue composition or samples for which the processing has gone wrong. Unexpectedly placed samples, of course, could also be highly interesting samples.

There are a number of options to change the appearance of the heat map. At the top of the Side Panel, you find the Heat map preference group (see figure 34.35).

Image heatmap_side_panel
Figure 34.35: Side Panel of heat map.

At the top, there is information about the heat map currently displayed. The information regards type of clustering, expression value used together with distance and linkage information. If you have performed more than one clustering, you can choose between the resulting heat maps in a drop-down box (see figure 34.36).

Image heatmap-dropdown
Figure 34.36: When more than one clustering has been performed, there will be a list of heat maps to choose from.

Note that if you perform an identical clustering, the existing heat map will simply be replaced. Below this box, there is a number of settings for displaying the heat map.

Below you find the Samples and Features groups. They contain options to show names, legend, and tree above or below the heat map. Note that for clustering of samples, you find the tree options in the Samples group, and for clustering of features, you find the tree options in the Features group. With the tree options, you can also control the Tree size, from tiny to very large, and the option of showing the full tree, no matter how much space it will use.

For clustering of features, the Features group has an option to "Optimize tree layout". This attempts to reorder the features, consistently with the tree, such that the most expressed features form a diagonal from the top-left to the bottom-right of the heat map.

The Samples group contains an "Order by:" dropdown that allows re-ordering of the columns of the heat map. When clustering by samples it is possible to choose between using the "Tree" to determine the sample ordering, and showing the "Samples" in the order they were input to the tool. When clustering by features, only the "Samples" input order is available.

Note that if you wish to use the same settings next time you open a heat map, you need to save the settings of the Side Panel.