K-mer Based Tree Construction

K-mer Based Tree Construction is a distance-based method that can create trees based on unaligned sequences. K-mers are used to compute distance matrices for distance-based phylogenetic reconstruction methods such as Neighbor Joining and UPGMA. This method is less precise than Create Tree, but is also less resource-intensive and does not require an alignment to be created first. This makes it suitable for creating trees based on larger numbers of long sequences. It is especially useful for whole genome phylogenetic reconstruction for closely related genomes, e.g., genomes with small differences relative to one anotherand few or no structural variations.

To launch K-mer Based Tree Construction, go to:

        Tools | Classical Sequence Analysis (Image gene_and_protein_analysis) | Alignments and Trees (Image alignmentsandtrees) | K-mer Based Tree Construction (Image kmer_tree_16_h_p)

This tool accepts individual sequences and sequence lists as input.

In the Tree Construction launch wizard step, select the construction method to use, the k-mer length, and a distance measure (figure 25.1):

Image kmerbased_step3
Figure 25.1: The Tree Construction launch wizard step.