Using the Assembly ID attribute
The Assembly ID attribute on sequences is used by many tools of the QIAGEN CLC Microbial Genomics Module to group sequences into meaningful entities, e.g. to group all contigs of a draft assembly. In order to see how these tools utilize the "Assembly ID" attribute, please read the tool manual.
Tools that are aware of this attribute include:
- Bin Pangenomes by Sequence
- Create K-mer Tree
- Create MLST Scheme
- Create Taxonomic Profiling Index
- Find Best Matches using K-mer Spectra
- Find Prokaryotic Genes
Attributes can be added to sequences in a sequence list in the following way:
- Open the Table view of a sequence list.
- Select all rows corresponding to sequences that should be grouped.
- Right-click on the selection and choose Add Attributes....
- Select Assembly ID from the dropdown menu in the Name field.
- Enter a string in the Value field to uniquely identify the assembly.
For large sequence lists containing many assemblies it may be beneficial to use Update Sequence Attributes instead.
