Basic analysis
The basic analysis section assesses the most simple characteristics that are supported by all sequencing technologies.The Summary table provides information regarding the creation date, the author, the software used, the number of data sets the report is based upon, as well as data name and content in terms of read number and total number of nucleotides.
- Sequence length distribution
- Counts the number of sequences that have been observed for individual sequence lengths. The resulting table correlates sequence-lengths in base-pairs with numbers of sequences observed with that number of base-pairs.
- Base coverage distribution
- Calculates absolute coverages for individual base positions. The resulting table correlates base-positions with the number of sequences that supported (covered) that position.
- Sequence-wise %GC-content distribution
- Counts the number of sequences that feature individual %GC-contents in 101 bins ranging from 0 to 100%.The %GC-content of a sequence is calculated by dividing the absolute number of G/C-nulceotides by the length of that sequence.
- Sequence-wise %N-content distribution
- Counts the number of sequences that feature individual %N-contents in 101 bins ranging from 0 to 100%, where N refers to all ambiguous base-codes as specified by IUPAC.The %N-content of a sequence is calculated by dividing the absolute number of ambiguous nucleotides through the length of that sequence.
- Base-wise nucleotide distributions
- Calculates absolute coverages for the four DNA nucleotides (A, C, G or T) for each base position in the sequences.
- Base-wise GC-distribution
- Calculates absolute coverages of C's + G's for each base position in the sequences.
- Base-wise N-distribution
- Calculates absolute coverages of N's, for each base position in the sequences, where N refers to all ambiguous base-codes as specified by IUPAC.