Options for clc_remove_duplicates
usage: clc_remove_duplicates <options>
Remove duplicate reads originating from sequencing process.
Options:
-h / --help: Display this help
-r [-i] <file> [<file2>] / --input [-i] <file> [<file2>]: Input read file(s). (Required)
-q [-i] <file> [<file2>] / --quality [-i] <file> [<file2>]: Specify separate input quality file(s).
-o <file> / --outputfile <file>: Set the output file without duplications. (Required)
-d <file> / --duplicatesfile <file>: Set the output read file with only duplications.
-s <file> / --statisticsfile <file>: Set the output file for distribution of duplicates.
-p / --paired: The data are paired.
-c / --colorspace: The data are from color space sequencing.
-m <n> / --memory <n>: Set the maximum amount of memory to use as a fraction
of the available memory (default is 1.0).
Default output format is fasta. However, if an output file ends with ".fastq" or ".fq" the program
will create a fastq (or an fq) file containing sequences and quality scores.
