Identify and Annotate Differentially Expressed Genes and Pathways

The Identify and Annotate Differentially Expressed Genes and Pathways compares genes expression in different groups of samples and performs a gene ontology (GO) enrichment analysis on the differentially expressed genes to identify affected pathways. The workflow takes as input Gene Expression (GE) or Transcript Expression (TE) tracks that were generated using the RNA-Seq analysis tool. The samples must be associated to a metadata table.

To run the ready-to-use workflow:

        Toolbox | Ready-to-Use Workflows | Whole Transcriptome Sequencing (Image rna_seq_group_closed_16_n_p) | Human, Mouse or Rat | Identify and Annotate Differentially Expressed Genes and Pathways (Image identify_differentially_expressed_genes_wts_16_n_p)

  1. Double-click on the Identify and Annotate Differentially Expressed Genes and Pathways ready-to-use workflow to start the analysis. If you are connected to a server, you will first be asked where you would like to run the analysis.

  2. Next, you will be asked to select the samples to analyze (figure 17.28). You can select several GE tracks or TE tracks generated by the RNA-Seq analysis tool, but not a combination of both. Click Next.

    Image rnaseq_identify_differentially_expressed_genes_step2
    Figure 17.28: Select the GE or TE tracks to analyze.

  3. In the next wizard step you can set up the experimental design associated with the data (figure 17.29):
    • Choose the metadata table that was associated to the GE or TE tracks used in the previous step.
    • Choose the factor (must be one of the metadata category) that should be used to test for differential expression.
    • It is possible to specify confounding factors, i.e., factors that are not of primary interest, but may affect gene expression.
    • The Comparisons panel determines the number and type of statistical comparison tracks output by the workflow (see Output of the Differential Expression for RNA-Seq tool for more details).

    Image rnaseq_identify_differentially_expressed_genes_step3
    Figure 17.29: Specify the experimental design desired for running the workflow.

  4. In the next step you can choose to preview the settings and save the results (see figure 17.30).

    Image rnaseq_identify_differentially_expressed_genes_step5
    Figure 17.30: The results handling step.

    Click Finish to start the analysis.

The following outputs are generated:

  1. PCA for RNA-Seq plot (Image pca_plot_16_n_p) Projects a high-dimensional dataset (where the number of dimensions equals the number of genes or transcripts) onto two or three dimensions.
  2. Statistical Comparison (Image stats_track_16_n_p) The information can be accessed in two different ways:
    • Open as a track, hold shift and hover over a feature. A tooltip will appear with information about gene name, results of statistical tests, and expression values.
    • Open the track in table format by clicking on the table icon in the lower left side of the View Area.
  3. Genome Browser View Differentially Expressed Genes and Pathways (Image trackset_16_n_p) A collection of tracks presented together. Shows the human reference sequence, annotation tracks for genes, coding regions CDS, mRNA, and statistical comparison tracks (see figure 17.31).
  4. Heat Map for RNA-Seq (Image heatmap_16_n_p) A two dimensional heat map of expression values. Each column corresponds to one sample, and each row corresponds to a feature (a gene or a transcript). The samples and features are both hierarchically clustered.
  5. Venn Diagram (Image venn_16_n_p) To compare the overlap of differentially expressed genes or transcripts in two or more statistical comparison tracks.
  6. Expression Browser (Image expression_experiment_16_h_p) To inspect gene and transcript expression level counts and statistics for many samples at the same time.
  7. GO Enrichment Analysis (Image table) A table showing the results of the GO enrichment analysis. The table includes GO terms, a description of the affected function/pathway, the number of genes in each function/pathway, the number of affected genes within the function/pathway, and p-values.

Image rnaseq_identify_differentially_expressed_genes_genomebrowserview
Figure 17.31: The genome browser view allows comparison of the expression comparison tracks with the reference sequence and different annotation tracks.

Please refer to the relevant sections of the the RNA-Seq Analysis tools chapter for additional information on the different output mentioned above.