Running multimodal workflows in batch using metadata

In order to run the workflow in batch, metadata must be provided to describe which DNA and RNA reads belong together.

See http://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Running_workflows_in_batch_mode.html for details on batch analysis, http://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Metadata.html for general information about metadata, and http://resources.qiagenbioinformatics.com/manuals/clcgenomicsworkbench/current/index.php?manual=Importing_metadata.html for how to import a metadata table.

Metadata can easily be supplied from an Excel spreadsheet or CSV file. A minimal spreadsheet should contain the start of the file names and a column containing the sample information:

Fastq Sample
DNA-S1 S1
RNA-S1 S1
DNA-S2 S2
RNA-S2 S2

Start the workflow normally, but remember to tick batch twice - once when selecting DNA reads (figure 16.11), and again when selecting RNA reads (figure 16.12).

Image multimodalbatchdnaselection
Figure 16.11: Both DNA samples are selected at the same time. They will be grouped into batches in a later step.

Image multimodalbatchrnaselection
Figure 16.12: Both RNA samples are selected at the same time. They will be grouped into batches in a later step.

Metadata can be chosen in the "Configure batching" dialog (figure 16.13).

Image multimodalmetadataexcel
Figure 16.13: Configuration of batch units based on metadata. Each batch unit is named after the DNA file name. DNA and RNA reads are grouped together if they share a value in the "Sample" column.

The next dialog will show how the batching will be performed (figure 16.14).

Image multimodalbatchoverview
Figure 16.14: Overview of the batch units. Each batch unit is named after the DNA file name. DNA and RNA reads are grouped together if they share a value in the "Sample" column.