Sanger sequencing data

The Sanger High-Throughput Sequencing Data Import tool is designed to handle the large volumes of Sanger data. Formats supported are ab, abi, ab1, scf and phd. Compressed data in gzip format is also supported (.gz).

Sanger sequencing data can also be imported using the standard Import (Image Next_Folder_16_n_p) tool (Import bioinformatics data). The following are key differences of the high throughput importer when compared to the standard importer:

Sanger data can also be imported during a workflow run using on-the-fly import, described in Launching workflows individually and in batches. Both the standard importer ("Trace files") and the high throughput importer ("Sanger") are available using the on-the-fly import.

The configuration step when using the high throughput Sanger importer is shown in figure 7.15.

Image importngsdialog-sanger
Figure 7.15: Selecting input and configuring a high throughput Sanger import

Configuring the import:

The next wizard step provides some options for handling the results. When the option to "Create subfolders per batch unit" is enabled, each sequence list created will be put into its own subfolder. This can be helpful for running analyses in batches and for organizing the results of subsequent analyses.