Visualization of OTU abundance tables
The OTU abundance tables containing the newly created OTUs or the chimeras give abundance of the OTU or chimeras at each site as well as the total abundance for all samples. There are a number of ways of visualizing the contents of an OTU abundance table:
- Table view () (figure 5.5)
Figure 5.5: OTU abundance table.The table display the following columns:
- Name The name of the OTU, specified by either the reference database or by the OTU representative (see below for more details).
- Taxonomy The taxonomy of the OTU, as specified by the reference database when a database entry was used as Reference.
- Combined Abundance The total number of reads belonging to the OTU across all samples.
- Min Minimum abundance across all samples
- Max Maximum abundance across all samples
- Mean Mean abundance of all samples
- Median Median abundance of all samples
- Std Standard deviation of all samples
- Abundance for each sample The number of reads belonging to the OTU in a specific sample.
- Sequence The sequence of the centroid of the OTU.
Note on OTU Names: The name is either
- the OTU name in the reference database (e.g. 978664)
- the name of the read used as centroid, which for sequencing data may look like a random of numbers and letters. If the same name is present more than once, then the OTUs will have a trailing number "-00123" like readName-12345.
- and if there is no name (for new clusters where reads have no name), something like OTU-12345 is assigned.
In the right side panel, under the tab Data, you can switch between absolute counts and relative abundances (relative abundances are computed as the ratio between the number of reads belonging to the OTU in a specific sample and the total number of reads in the sample). You can also combine absolute counts and relative abundances by taxonomic levels by selecting the appropriate phylum in the Aggregate feature drop-down menu. Use the option below to Hide samples for which the taxonomy at the aggregated taxonomic level is incomplete. Finally, if you have previously annotated your table with Metadata (see section 9.7), you can Aggregate sample by the groups previously defined in your metadata table. This is useful when analyzing replicates from the same sample origin.
Under the table, the following actions are available:
- Create Abundance Subtable will create a table containing only the selected rows.
- Create Sequence Sublist will create a sequence list containing only the selected rows.
- Create Normalized Abundance Subtable will create a table with all rows normalized on the values of a single selected row. Note that to be enabled, the selected row for normalization can only have non null abundance values. If you have zero values in some samples for the control, you will need to generate a new abundance table where these samples are not present. If the abundance table is obtained from merging single-sample abundance table, than the merge should be redone excluding the samples with zero control read counts.
- Stacked Bar Chart and Stacked Area Chart ()
In the Stacked Bar (figure 5.6) and Stacked Area Charts (figure 5.7), the metadata can be used to aggregate groups of columns (samples) by selecting the relevant metadata category in the right hand side panel. Also, the data can be aggregated at any taxonomy level selected. The relevant data points will automatically be summed accordingly.
Figure 5.6: Stacked bar of the microbial community at the order level for 3 different sites.
Figure 5.7: Stacked area of the microbial community at the phylum level for 12 different sites.Holding the pointer over a colored area in any of the plots will result in the display of the corresponding taxonomy label and counts. Filter level allows to modify the number of features to be shown in the plot. For example, setting the value to 10 means that the 10 most abundant features of each sample will be shown in all columns. The remaining features are grouped into "Other", and will be shown if the option is selected in the right hand side panel. One can select which taxonomy level to color, and change the default colors manually. Colors can be be specified at the same taxonomy level as the one use to aggregate the data or at a lower level. When lower taxonomy levels are chosen in the data aggregation field, the color will be inherited in alternating shadings. It is also possible to sort samples by metadata attributes, and to show groups of samples without collapsing their stacks, as well as change the label of each stack or group of stacks. Features can be sorted by "abundance" or "name" using the drop down menu in the right hand side panel. Using the bottom right-most button (Save/restore settings ()), the settings can be saved and applied in other plots, allowing visual comparisons across analyses.
- Zoomable Sunbursts ()
The Zoomable Sunburst viewer lets the user select how many taxonomy level counts to display, and which level to color. Lower levels will inherit the color in alternating shadings. Taxonomy and relative abundances (the ratio between the number of reads belonging to the OTU in a specific sample and the total number of reads in the sample) are displayed in a legend to the left of the plot when hovering over the sunburst viewer with the mouse. The metadata can be used to select which sample or group of samples to show in the sunburst (figure 5.8).
Figure 5.8: Sunburst view of the microbial community showing all taxa belonging to the kingdom bacteria.Clicking on a lower level field will render that field the center of the plot and display lower level counts in a radial view. Clicking on the center field will render the level above the current view the center of the view (figure 5.9).
Figure 5.9: Sunburst view of the microbial community zoomed to show all taxa belonging to the phylum bacteroidetes.