General notes on handling paired data

During import, information about the orientation of paired data is stored by the CLC Genomics Workbench. This means that all subsequent analyses will automatically take differences in orientation into account. Once imported, both reads of a pair will be stored in the same sequence list. The forward and reverse reads (e.g. for paired-end data) simply alternate so that the first read is forward, the second read is the mate reverse read; the third is again forward and the fourth read is the mate reverse read. When deleting or manipulating sequence lists with paired data, be careful not break this order.

You can view and edit the orientation of the reads after they have been imported by opening the read list in the Element information view (Image sequence_info) as shown in figure 6.12.

Image paired-orientation-view
Figure 6.12: The paired orientation and distance.

In the Paired status part, you can specify whether the CLC Genomics Workbench should treat the data as paired data, what the orientation is and what the preferred distance is. The orientation and preferred distance is specified during import and can be changed in this view.

Note that the paired distance measure that is used throughout the CLC Genomics Workbench is always including the full read sequence. For paired-end libraries it means from the beginning of the forward read to the beginning of the reverse read.