Storing, managing and moving reference data

QIAGEN reference data is downloaded to a CLC location called CLC_References. Data stored in a CLC_References location can only be deleted using functionality in the Reference Data Manager.

QIAGEN reference data download is covered in other sections, but in brief: it can downloaded via the Reference Data Manager, via launch wizards for workflows with input elements configured to use workflow roles and via the QIASeq Panel Analysis Assistant.

This section describes general functionality relating to managing reference data in CLC_References locations, including when working with a CLC Server and how to get download QIAGEN reference data intended for use on non-networked machines.

CLC_References in the Workbench

By default, a local CLC_References location refers to a folder of the same name in your home area. If that folder does not already exist when the CLC Genomics Workbench is first started up, it is created and added as a CLC Data Location.

The underlying folder CLC Data Locations are mapped to can be seen by hovering the mouse cursor over the location in the Navigation Area (figure 11.25).

Image ref_data_loc_mouseover_gwb
Figure 11.25: Hover the mouse cursor over a CLC_References File Location to see the folder it is mapped to on the file system. By default, this is a folder in your home area (top). When connected to a CLC Server with a CLC_References Location, the tooltip states that the location is on the server (bottom).

Specifying a different folder for reference data

The file system folder that a local CLC_References File Location should be mapped to can be configured by right-clicking the CLC_References location and choosing Location | Specify Reference Location (figure 11.26).

Image specify_new_ref_data_loc_gwb
Figure 11.26: To map a local CLC_References location to a different folder on the file system, right-click on CLC_References in the Navigation Area and select Location | Specify Reference Location.

Updating where the CLC_References File Location is mapped to does not remove the old CLC_References folder on the file system or its contents. Standard system tools should be used to delete these items if they are no longer needed.

CLC_References in a CLC Server setup

When the CLC Genomics Workbench is connected to CLC Genomics Server configured with a CLC Server File System Location called CLC_References, the option "On Server" is available in the Manage Reference Data drop-down list in the Reference Data Manager (figure 11.27).

When the "On Server" option is selected, the information shown in the Reference Data Manager refers to data stored in the CLC_References File System Location on the CLC Server, and data downloaded via the Reference Data Manager is downloaded to that location. By default, data is downloaded directly to the CLC Server, but downloads can be configured to go via the CLC Genomics Workbench instead using a setting in the Workbench Preferences. This can be useful if the CLC Server does not have access to the external network but the CLC Genomics Workbench does.

Image rdm-datadownload-onserver
Figure 11.27: The "On Server" option has been selected in the Reference Data Manager.

Copying reference data

Reference data can be copied from a CLC_References location in a CLC Workbench to a CLC_References location on a CLC Server or vice versa.

A button labeled Copy from WB will be visible when the selected data is available in the CLC_References area of your Workbench and you have selected the "On Server" option in the Reference Data Manager. Clicking on this button copies the data from the Workbench to the CLC Server CLC_References location.

Conversely, if you are working with a CLC_References location on your Workbench (i.e. the "Locally" option is selected in the Reference Data Manager) and you are connected to a CLC Server with a CLC_References location configured, a button labeled Copy from server will be present. Clicking on this copies the data to your Workbench CLC_References location from the CLC Server CLC_References location.

Copying data from other locations into a CLC_References location is described in Imported Data. Copying data from a CLC_References location to elsewhere on the file system is described in 

Reference data on non-networked systems

If the CLC Genomics Workbench is installed on systems without access to the external network, the following steps can be followed to import reference data to the non-networked Workbench:

  1. Run the CLC Genomics Workbench on a machine with access to the external network.

    If this is a new installation, either configure the relevant licensing details in the License Assistantor run the Workbench in Viewing Mode.

  2. Use the Reference Data Manager on the networked Workbench to download the reference data of interest. By default, this would be downloaded to a folder called CLC_References.

  3. When the download is completed, copy the CLC_References folder and all its contents to a location where the machines with the CLC software installed can access it.

  4. Get the software to refer to that folder for reference data: in the Navigation Area of the non-networked Workbench, right click on the CLC_References, and choose the option "Specify Reference Location...". Choose the folder you imported from the networked Workbench and click Select.



Subsections