Data structures for transcriptomics
The two main data structures used for transcriptomics data analysis in the CLC Main Workbench are tracks and experiments.
The track format can be used to visualize and analyze data in the CLC Main Workbench. All information is tied to genomic positions, and a central coordinate-system is provided by a reference genome. This allows different types of data or results for different samples to be seen and analyzed together.
Experiments (see Experiments), on the other hand, are used to represent complex relationships between expression samples, and to carry out statistical analysis (see statistical analysis) of differential expression.
Tracks and experiments are intimately related, and it is possible in most cases to convert from one type to the other.
The starting point is a set of reads from a sequencing study that can be analyzed with the RNA-Seq Analysis tool in the CLC Genomics Workbench or the Biomedical Genomics Workbench and imported to the CLC Main Workbench. As part of the RNA-Seq Analysis tool, these reads are mapped onto a reference genome. The RNA-Seq tool produces expression tracks, which are compatible with the reference genome, and can be visualized together with the genome in the Track List View
Once expression tracks have been obtained from the RNA-Seq Analysis tool, they can be used as sequencing-based sets of expression values in setting up an experiment. This can be done using the Set up Experiment tool and it is described in more detail in Experiments.
An experiment set up in this manner from expression tracks is intimately coupled to the tracks it originated from. To see this coupling in action, perform the following steps:
- Use the Set up Experiment tool on two or more expression tracks to set up an experiment, as described in Experiments.
- Save and open the resulting experiment, by double-clicking its name in the Navigation Area.
- Use the Create Track List tool to create a track list from the expression tracks you used to set up the experiment.
- Save and open the resulting track list by double-clicking its name in the Navigation Area.
- Drag the experiment tab downwards, until you see the blue shadow indicating the resulting placement (figure 22.19), and drop it in place. You should now have a divided view, with the experiment in the bottom half (figure 22.20).
- Clicking on any line in the experiment will now automatically jump to the corresponding genomic location in the upper view. Use the Zoom to Selection () button to zoom in to the desired genomic region.
Figure 22.19: Dragging a tab to the lower half of the view area.
Figure 22.20: After dropping a tab to the lower half othe view area.