Tracks

The primary outputs of the Transcript Discovery tool are a gene and a transcript tracks containing all "known" genes/transcripts (if these were supplied as input), as well as some "unknown" genes/transcripts (i.e., those that we have predicted, unless none were predicted). Note that by definition, every unknown gene/transcript present in the track is detected, while "known" genes/transcripts may or may not be detected.

Predicted gene track

A Predicted gene track viewed a table is shown in figure 3.7.

Image predictedgenetable
Figure 3.7: A Predicted gene track seen as a table, and showing "known" and "unknown" genes.

The column headers of the Predicted gene table are:

Unknown genes are given names of the form Gene_1, Gene_2 etc. In order to avoid name clashes with previous predictions, the algorithm checks the previously annotated transcripts for genes with names of this form. New predictions will then be output with the next available index. For example, if a "Gene_11" is already present in the previously annotated transcripts, then the first new gene will be Gene_12.

These annotations sometimes include the sample name, such that if the tool is run on "Sample A", then the output transcript track will have table columns of the form "Coverage "Sample A"". This new track can then be supplied as input when the tool is run on "Sample B". The output transcript track will then have table columns "Coverage "Sample A"" and "Coverage "Sample B"".

Predicted transcript track

Similarly a transcript track contains all the "known" input transcripts plus "unknown" transcripts if any were detected (figure 3.8).

Image predictedtranscripttable
Figure 3.8: A Predicted transcript track seen as a table, and showing "known" and "unknown" transcripts.

For detected transcripts, the following annotations are available:

Predicted CDS track

Predicted CDS tables have, on top of some of the column headers described above, the following ones:

Note that uncertainty in whether an ORF might start before the first observed start codon can be seen by the use of an "open" position, for example <130 for an ORF that is annotated at position 130, but may actually begin earlier in the reference figure 3.9

Image eventsorf
Figure 3.9: A jagged line at the beginning of the annotation shows that the ORF may start before the first observed start codon.

Accepted events track

Accepted events tables have, on top of some of the column headers described above, the following ones:

Rejected events track

Rejected events tables have, on top of some of the column headers described above, the following ones:

Image someRejectedEvents
Figure 3.10: Rejected events' Names as seen in the track view of the Rejected events track.

The Name describing the filter causing an event to be rejected also refers to which filter can be adjusted in a second iteration of the tool to report the event in the Accepted events track:

However, there are no configurable filters to change the rejection of the following events:

Track list

It can be very handy to see annotations reflecting the various steps in the transcript discovery in a Track list. An example is shown in figure 3.11

Image events2
Figure 3.11: A track list showing the read mapping along with the Accepted events track and the Predicted gene track.