Importing data on the fly

There are two ways that raw data, i.e. data not already imported into the CLC software, can be imported as part of a workflow run:

Examples of these 2 design types are shown in figure 12.30. How these translate when launching the workflow is shown in figure 12.31. The relative merits of each option are outlined in table 12.1. For most uses, on-the-fly import will be the most versatile option.

Image workflow_input_elements_and_import
Figure 12.30: Raw data can imported as part of a workflow run in 2 ways. Left: Include an Input element. and use on-the-fly import. Right: Use a specific Import element. Here, the Illumina import element was included.

Image workflow_import_on_launch
Figure 12.31: Top: Launching a workflow with an Input element and choosing to select files to import on-the-fly. Bottom: Launching a workfow with a dedicated import element, in this case, an Illumina import element.


Table 12.1: Workflow import methods compared
Functionality Input element Dedicated import element
Running in batch mode Supported.
Check the Batch option in the launch wizard.
Not supported.
(The Batch option is not visible in the launch wizard).
Iterate elements Supported. Supported.
Choosing an importer when launching Any available importer can be selected when launching. Use of already-imported data is also supported. Workflow authors can specify the importers available when launching. Only data formats relevant for the specific importer can be selected for use.
Configuring import options Options for all importers allowed by the workflow author can be configured, and set to be unlocked or locked. Import options for the specific importer can be configured,and set to be unlocked or locked.
Saving imported elements Not supported.
The elements created during import are not saved.
Supported.
If an Output element is attached to the Import element, the elements created during import can be saved.


Notes:



Footnotes

...sec:importNGS12.3
Paired read handling for workflows launched in batch mode, or workflows with Iterate elements, is the same as for the importer tools themselves: If the Paired option is checked, files are handled as described in the manual section on NGS importers. In CLC Genomics Workbench21.x, this was also the case in most circumstances. However, if batch units were based on data organization and all files were in the same folder, each file was treated as a separate batch unit irrespective of whether the Paired option was checked.