System requirements

The system requirements of CLC Server are:

Server system

Server hardware requirements

Performance note: Tools that take advantage of multiple cores do not scale linearly with high numbers of cores. If you plan to use a large system (>64 cores), a setup with job nodes running on virtual machines provides the potential to use more of the compute capacity. On a job node setup, analyses can be run in parallel, with appropriate cpu limits configurable for each node, as described in Configuring your job node setup.

Memory and CPU settings for mapping reads

For mapping reads to the human genome ( 3.2 gigabases), or genomes of a similar size, 16 GB RAM is required. Smaller systems can be used when mapping to small genomes.

Larger amounts of memory can help the overall speed of the analysis when working with large datasets, but little gain is expected above about 32 GB of RAM.

Increasing the number of cpus can decrease the time a read mapping takes, however performance gain is expected to be limited above approximately 40 threads.

System requirements for de novo assembly

De novo assembly may need more memory than stated above - this depends both on the number of reads, error profile and the complexity and size of the genome.

When assembling PacBio HiFi reads with the De Novo Assemble Long Reads tool, at least 32 GB RAM is recommended.

For examples of the memory usage of various data sets when using the De Novo Assembly tool, see https://resources.qiagenbioinformatics.com/white-papers/White_paper_on_de_novo_assembly_4.pdf.

Other requirements for long read analysis

The following tools for working with long reads require an AMD/Intel CPU that supports AVX2 or an Apple M series CPU:

Polish Contigs with Reads and Correct Long Reads (legacy) use Racon [Vaser et al., 2017], for which the following is noted: Memory consumption of polishing equals the size of the mandatory input files, while the memory is somewhat larger for error correction. See https://complex.zesoi.fer.hr/en/blog-en/58-racon-1-0-release.

Special requirement for the shared filesystem used by the job node setup or grid integration

The file locking mechanism is required to ensure that all nodes see the latest version of the data stored on the shared filesystem.

Special requirements for containerized external applications

Containerized external applications1.1 are supported for Unix images run on Unix hosts only. Windows-based images and Windows hosts are not supported. Standard external applications are supported for any CLC Server setup.



Footnotes

... applications1.1
Containerized external applications were introduced in CLC Server 21.0.