In order to run the workflow in batch, metadata must be provided to describe which DNA and RNA reads belong together. This metadata can easily be supplied from an Excel spreadsheet. A minimal spreadsheet is shown in figure 8.6
. It contains the start of the sample names and a column containing pairing information.
Figure 8.6: A minimal spreadsheet for running two multimodal samples, each consisting of a DNA and an RNA part. Here, we aim to run the two S1 samples in one execution of the workflow, and to run the two S2 samples in a second execution.
Start the workflow normally, but remember to tick batch twice - once when selecting DNA reads (figure 8.7), and again when selecting RNA reads (figure 8.8).
Figure 8.7: Both the input DNA samples are selected. They will be grouped into batches in a later step.
Figure 8.8: Both the input RNA samples are selected. They will be grouped into batches in a later step.
When this is done, a "Configure batching" dialog will ask for metadata. Figure 8.6 shows how to configure this:
Figure 8.9: Configuration of batch units based on metadata. Each batch unit is named after the DNA sample name. DNA and RNA reads are grouped together if they share a value in the "Pair" column.
The next dialog will show you how the batching will be performed figure 8.10.
Figure 8.10: Overview of the batch units. Each batch unit is named with the DNA sample name. DNA and RNA reads are grouped together if they share a value in the "Pair" column.