Over-representation analysis
The 5mer analysis examines the enrichment of penta-nucleotides. The enrichment of a 5mer is calculated as the ratio of observed and expected 5mer frequencies. An expected frequency is calculated as product of the empirical nucleotide probabilities that make up the 5mer. (Example: given the 5mer = CCCCC and cytosines have been observed to 20% in the examined sequences, the 5mer expectation is
). Note that 5mers that contain ambiguous bases (anything different from A/T/C/G) are ignored.
- Individual 5mer distribution
- Calculates absolute coverages for each base position and each 5mer independently and plots top five enriched 5mers.