Minimum Spanning Trees

A minimum spanning tree is a tree connecting all nodes in a graph, in a way such that the sum of edge lengths is minimized, see figure 13.5.

Image lmlst_spanning_tree
Figure 13.5: Minimum Spanning Tree construction. (Public domain image from Wikimedia: Minimum spanning tree.svg.)

Image lmlst_mst_distances
Figure 13.6: Minimum Spanning Tree distances.

Minimum spanning trees can be a bit counter-intuitive. Consider figure 13.6: The distance between A and B is not necessarily larger than the distance from A to C. But we do know that the distance between two nodes is greater or equal to the largest edge connecting them. (e.g. the distance between A and B is at least two, and the distance between A and C is at least two)

Minimum spanning trees are often used to visualize relationships between strains or isolates. But note that MST's are not unique - there are often many possible trees, especially for the classic 7-gene schemes, where there are only a very limited number of possible edge lengths. In order to break the ties when constructing the tree, our MST implementation favor creating connections to nodes that have many low-distance relations in the allelic distance matrix.

Minimum Spanning Trees can be created using the Create Large MLST Scheme, the Download Large MLST Scheme, the Import Large MLST Scheme tools or the Create Large MLST Sub Scheme button of an existing scheme.



Subsections