An example using Illumina barcoded sequences
The data set in this example can be found at the Short Read Archive at NCBI. Use the Search for Reads in SRA... tool to search for SRX014012. Select the SRR03730 item and click Download Reads and Metadata. Save the sequence list in the Navigation Area, and use it with the Demultiplex Reads tool.
The barcoding was done using the following tags at the beginning of each read: CCT, AAT, GGT, CGT (see supplementary material of [Cronn et al., 2008]). The settings in the dialog should thus be as shown in figure 24.27.
Figure 24.27: Setting the barcode length at three.
Click next to the "Set barcode options" dialog and use the Add button) to specify the bar codes as shown in figure 24.28.
Figure 24.28: A preview of the result
With this data set we got the four groups as expected (shown in figure 24.29). The Not grouped list contains 445,560 reads that will have to be discarded since they do not have any of the barcodes.
Figure 24.29: The result is one sequence list per barcode and a list with the remainders