Gene Set Test

Gene Set Test takes a Statistical comparison track (Image expression_comparison_track_16_n_p) and a Gene Ontology Annotation (GOA) table (Image array_annotations) as input. The tool outputs a GO enrichment analysis table, summarizing the results of hypergeometric tests that evaluate whether terms from the GOA table are over-represented in the set of differentially expressed features from the input track.

To run the tool, go to:

        Tools | RNA-Seq and Small RNA Analysis (Image rna_seq_group_closed_16_n_p)| Differential Expression (Image rna_expression_folder_closed_16_n_p) | Gene Set Test (Image identify_differentially_expressed_genes_16_n_p)

The following options can be configured in the Annotation testing parameters dialog (figure 33.95):

Image genesettest_output1
Figure 33.95: Annotation testing parameters.

Image genesettest_output5
Figure 33.96: GO terms with the [IEA] tag are computationally inferred.

The following options for defining the differentially expressed features can be configured in the Filtering parameters dialog (figure 33.97):

Image genesettest_output2
Figure 33.97: Filtering parameters.

The tool outputs a GO enrichment analysis table (figure 33.98). Each row in the table corresponds to a GO term and includes information on the number and names of the detected and differentially expressed (DE) features, as well as the hypergeometric test p-value, FDR, and Bonferroni corrected p-values.

Image genesettest_output3
Figure 33.98: GO enrichment analysis table with rows sorted by FDR p-value.

GO terms are organized in a hierarchical structure. For example, the term "GO:0033151 V(D)J recombination" from the Gene Ontology [Ashburner et al., 2000,The Gene Ontology Consortium, 2019] (https://geneontology.org/) is a descendant of "GO:0006259 DNA metabolic process".

When testing for the significance of a particular GO term, all features linked to descendant GO terms are included in the test. This can lead to a higher number of detected genes in the output table, compared to the number of genes linked to the tested GO term.

Due to the hierarchical structure, GO terms are not independent of one another, and the p-values provided in the enrichment analysis table should be interpreted with caution.