Configuring Workflow Input elements

Workflow Input elements are used to provide data to a workflow. At least one such element is required in a workflow for supplying the data for analysis. Input elements can be connected to any input channel expecting data.

When there is more than one Input element in a workflow, the order that data is prompted for in the launch wizard can be configured (see Ordering inputs).

For information about configuring reference data inputs in workflows, see Reference data and workflows. The use of the "Workflow role" option, available in tool input channels and in Input elements where reference data is expected is described there. Information about launching workflows where workflow roles have been defined is described in Specifying reference data in the launch wizard.

Configuring Input element options

By default, Input elements are configured to allow data to be selected from CLC locations listed in the Navigation Area, or from elsewhere, in which case, the data will be imported using on-the-fly import. The workflow author can limit these options (figure 14.38), as well as configure import options with non-default values. Settings can be locked if they should not be configurable when launching the workflow (see Basic configuration of workflow elements).

On-the-fly import options that can be configured in Input elements are:

Note: To specify CLC data not stored in a CLC location as input to a workflow, on-the-fly import must be allowed, and CLC Format must be one of the allowed importers. See also Launching workflows individually and in batches.

Image configure-workflow-input-element-genomics
Figure 14.38: The allowed data sources are configured in Input elements. By default, both checkboxes in the Advanced section are enabled, allowing data to be selected from a CLC location (input from the Navigation Area), or from another location (on-the-fly import). For on-the-fly import, any available importer can be used by default. When only specific importers are allowed, those importers can be configured by selecting each in turn and clicking on the "Configure Parameters" button. In this case, only the data types supported by the specified importers, can be selected as input for on-the-fly import when launching the workflow.

On-the-fly import versus using workflow elements for specific importers

For importing data as the first step of a workflow, on-the-fly importer, as described above, is the most flexible and commonly used option. However, workflow input elements for specific NGS importers are also available.

Examples using each of these options are shown in figure 14.39. How these translate when launching the workflow is shown in figures 14.40 and 14.41. The relative merits of each option are outlined in table 14.1.

Image workflow_input_elements_and_import
Figure 14.39: Raw data can imported as part of a workflow run in 2 ways. Left: Include an Input element. and use on-the-fly import. Right: Use a specific Import element. Here, the Illumina import element was included.

Image workflow_import_on_launch_on_the_fly
Figure 14.40: Launching a workflow with an Input element and choosing to select files to import on-the-fly.

Image workflow_import_on_launch_dedicated_importer
Figure 14.41: Launching a workfow with a dedicated import element, in this case, an Illumina import element.


Table 14.1: Workflow import methods compared
Functionality Input element Dedicated import element
Running in batch mode Supported.
Check the Batch option in the launch wizard.
Not supported.
(The Batch option is not visible in the launch wizard).
Iterate elements Supported. Supported.
Choosing an importer when launching Any available importer can be selected when launching. Use of already-imported data is also supported. Workflow authors can specify the importers available when launching. Only data formats relevant for the specific importer can be selected for use.
Configuring import options Options for all importers allowed by the workflow author can be configured, and set to be unlocked or locked. Import options for the specific importer can be configured,and set to be unlocked or locked.
Saving imported elements Supported.
To save the imported elements, connect a Save On-the-fly Imports element to the Input element, and connect its output to an Output element.
Supported.
To save the imported elements, connect an Output element the Import element.


Notes:

Saving data imported on-the-fly

To save data elements created using on-the-fly import, connect a Save On-the-Fly Imports (Image optional_output_16_n_p) element to the relevant Input element and connect an Output element to it (figure 14.42).

The Save On-the-Fly Imports element only acts on data imported using on-the-fly import. If input data is selected from a CLC File Location, this element has no effect.

Image save-on-the-fly
Figure 14.42: Using this workflow, data imported on-the-fly would be saved as an output from the Save On-the-Fly Imports element.

A Save On-the-Fly Imports element is not needed when an Iterate element is connected to the Input element being used for on-the-fly import. In this situation, an Output element can be connected directly to the Iterate element (figure 14.43).

Image save-on-the-fly-import-from-iterate
Figure 14.43: When an Iterate element is connected to an Input element, data imported on-the-fly can be saved by connecting an Output element to the Iterate element.



Subsections