Detect Amplicon Sequence Variants and Assign Taxonomies
The Detect Amplicon Sequence Variants and Assign Taxonomies workflow processes reads from amplicon sequencing to yield a merged multi-sample (if applicable) ASV abundance table, and subsequently assigns taxonomies to the ASVs (amplicon sequence variants).
We recommend making preliminary evaluations of the read lengths and qualities, to decide on parameter settings like read length. This can be done by running a single sample through the workflow, and taking a look at the resulting trim report section Read length before / after trimming.
Launching the workflow
The Detect Amplicon Sequence Variants and Assign Taxonomies workflow is at:
Toolbox | Template Workflows () | Microbial Workflows () | Metagenomics () | Amplicon-Based Analysis () | Detect Amplicon Sequence Variants and Assign Taxonomies ()
Launch the workflow and step through the wizard.
- Select the sequence list(s) containing the reads to process and click on Next.
- Select a Taxonomic Profiling Index and click on Next (figure 2.16).
Figure 2.16: Wizard step for selecting the Taxonomic Profiling Index. - The 'Configure batching' and 'Batch overview' steps can be left as is (figure 2.17), or configured as described in https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Launching_workflows_individually_in_batches.html.
Figure 2.17: Optional wizard step to configure metadata for the input sequences. - Select a Trim Adapter List if relevant for your application (figure 2.18). The Trim Adapter List should correspond to the adapters used for sequencing. If no input is provided, the tool will skip the adapter trimming step. Click on Next.
Figure 2.18: Wizard step for selecting the Trim Adapter List. - Choose the trim length to use for Detect Amplicon Sequence Variants and decide whether to remove chimeras by toggling the Remove chimeras box (figure 2.19). Click on Next.
The optimal read length setting will depend on the length of your reads after trimming. We recommend that you have a look at the trim report section Read length before / after trimming if you are unsure about what value to set.
Figure 2.19: Wizard step for selecting read trim length and whether to remove chimeras in the Detect Amplicon Sequence Variants tool. - Finally, select a location to save outputs to and click on Finish.
Workflow tools and outputs
The Detect Amplicon Sequence Variants and Assign Taxonomies workflow consists of the below mentioned tools. See figure 2.20 for a full overview of the workflow.
- QC for Sequencing Reads. Performs basic QC on the sequencing reads and outputs a report that can be used to evaluate the quality of the sequencing reads, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=QC_Sequencing_Reads.html. A graphical and a supplementary report is output for each input sample.
- Trim Reads. Removes adapter sequences (optional) and low quality nucleotides, see https://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Trim_Reads.html. The tool outputs a trim report for each sample.
- Detect Amplicon Sequence Variants. Trims and filters the reads, followed by dereplication and denoising. If chosen in the wizard, chimeras will be removed, and in case of paired-end reads, the unique read pairs are merged, see
Detect Amplicon Sequence Variants. The workflow outputs one ASV sequence list and ASV report per sample.
- Merge Abundance Tables. Merges the per-sample ASV abundance tables from the previous step and outputs a merge report. The merged ASV table is used as input for Assign Taxonomies to Sequences in Abundance Table.
- Assign Taxonomies to Sequences in Abundance Table. Assigns taxonomies to the sequences of the merged ASV abundance table according to the reference index provided, see Assign Taxonomies to Sequences in Abundance Tables. The tool outputs a report and a merged ASV abundance table with taxonomies.
To learn about the ASV abundance table, see Detect Amplicon Sequence Variants output. - Combined analysis report. Combines the report content from the Trim Reads and Detect Amplicon Sequence Variants tools for all samples in the workflow run.
- Combined QC report. Contains QC metrics for the raw reads for all samples in the workflow run.
Figure 2.20: Layout of the Detect Amplicon Sequence Variant and Assign Taxonomies workflow.