Functional profile abundance table
The functional profile abundance table displays the names of the function, along with their clan, a combined abundance. The table can be visualized using the Stacked bar charts and Stacked area charts function, as well as the Sunburst charts.
- Table view () (figure 13.14)
Figure 13.14: Functional profile abundance table.The table displays the following columns:
- ID: internal ID the abundance tables use for ordering the samples. IDs are unique, while Names are not necessarily, so that when merging abundance tables taxa with the same ID will be combined.
- Name: the name of the taxon, specified by the reference database or the NCBI taxonomy. If the name contains the text "(Unknown)", it indicates that this taxon corresponds to a higher-level node in the taxonomy, and that this node had a significant amount of reads associated to ancestor taxons that are present in the database but were disqualified. This indicates that there was some organism in the sample for which there is no exactly matching reference in the database, but is most likely closely related to this taxon. If the name does not contain the text "(Unknown)", it means that the sample contains this exact taxon, which is present in the database.
- Clan: a collection of related Pfam entries. The relationship may be defined by similarity of sequence, structure or profile-HMM.
- Combined Abundance: total number of reads for the function across all samples
- Min, Max, Mean, Median and Std: respectively minimum, maximum, mean, median and standard deviation of the number of reads for the fucntion across all samples
- Abundance for the sample: number of reads for each sample
In the right side panel, under the tab Data, you can switch between raw and relative abundances (relative abundances are computed as the ratio between the coverage for a function in a specific sample and the amount of coverage in the sample). You can also combine absolute counts and relative abundances by selecting the Clan level in the Aggregate feature drop-down menu.
Finally, if you have previously annotated your table with Metadata (see section 7.8), you can Aggregate sample by the groups previously defined in your metadata table. This is useful when for example analyzing replicates from the same sample origin.
Under the table, the following actions are available:
- Create Abundance Subtable will create a table containing only the selected rows.
- Create Normalized Abundance Subtable will create a table with all rows normalized on the values of a single selected row. The row used for normalization will disappear from the new abundance table. The normalization scales the abundance tables linearly, where the scaling factor is calculated by determining the average abundance across all samples and for each sample scale it to the average for the reference. Note that to be enabled, the selected row for normalization can only have non null abundance values. If you have zero values in some samples for the control, you will need to generate a new abundance table where these samples are not present. If the abundance table is obtained from merging single-sample abundance table, then the merge should be redone excluding the samples with zero control read counts.
- Stacked Bar Chart and Stacked Area Chart ()
Choose which chart you want to see using the drop down menu in the upper right corner of the side panel. In the Stacked Bar (figure 13.15) and Stacked Area Charts (figure 13.16), the metadata can be used to aggregate groups of columns (samples) by selecting the relevant metadata category in the right hand side panel. Also, the data can be aggregated at any taxonomy level selected. The relevant data points will automatically be summed accordingly.
Figure 13.15: Stacked bar chart.
Figure 13.16: Stacked area chart.Holding the pointer over a colored area in any of the plots will result in the display of the corresponding taxonomy label and counts. Filter level allows to modify the number of features to be shown in the plot. For example, setting the value to 10 means that the 10 most abundant features of each sample will be shown in all columns. The remaining features are grouped into "Other", and will be shown if the option is selected in the right hand side panel. One can select which taxonomy level to color, and change the default colors manually. Colors can be specified at the same taxonomy level as the one used to aggregate the data or at a lower level. When lower taxonomy levels are chosen in the data aggregation field, the color will be inherited in alternating shadings. It is also possible to sort samples by metadata attributes, and to show groups of samples without collapsing their stacks, as well as change the label of each stack or group of stacks. Features can be sorted by "abundance" or "name" using the drop down menu in the right hand side panel. Using the bottom right-most button (Save/restore settings ()), the settings can be saved and applied in other plots, allowing visual comparisons across analyses.
- Zoomable Sunbursts ()
The Zoomable Sunburst viewer lets the user select how many taxonomy level
counts to display, and which level to color. Lower levels will inherit the
color in alternating shadings. Taxonomy and relative abundances (the ratio
between the coverage of the species in a specific sample and the
total amount of coverage in the sample) are displayed in a legend to the left
of the plot when hovering over the sunburst viewer with the mouse. The metadata
can be used to select which sample or group of samples to show in the sunburst
(figure 13.17).
Clicking on a lower level field will render that field the center of the plot and display lower level counts in a radial view. Clicking on the center field will render the level above the current view the center of the view.