RNA-Seq Analysis for Long Reads
The RNA-Seq Analysis for Long Reads tool supports analysis of RNA-Seq data by mapping sequencing reads to an annotated reference genome with minimap2 [Li, 2018] and distributing and counting the reads across genes and transcripts. Subsequently, the results can be used for expression analysis.
RNA-Seq analysis with long reads is done in several steps: First, all annotated transcripts or genes are extracted. If there are several annotated splice variants, they are all extracted. Next, the reads are mapped against all the transcripts, and to the whole genome using minimap2. For more information about the read mapper, see Map Long Reads to Reference.
From this mapping, the reads are categorized and assigned to the transcripts using the EM estimation algorithm, and expression values for each gene are obtained by summing the transcript counts belonging to the gene.
For detailed information on RNA-Seq analysis including the EM algoritm, see The EM estimation algorithm.
Subsections
- Reads and reference settings
- Mapping settings
- Expression settings
- RNA-Seq Analysis for Long Reads output