Copy Number Variant Detection

The Copy Number Variant Detection tool is designed to detect copy number variations (CNVs) from targeted resequencing experiments. The current release is a beta version.

The tool takes read mappings and target regions as input, and produces amplification and deletion annotations. The annotations are generated by a "depth-of-coverage" method, where the target-level coverages of the case and the controls are compared in a statistical framework. The algorithm implemented in the Copy Number Variant Detection tool is inspired by the following papers:

The Copy Number Variant Detection tool identifies CNVs regions where the normalized coverage is statistically significantly different from the controls.

The algorithm carries out the analysis in several steps.

  1. Base-level coverages are analyzed for all samples, and a robust coverage baseline is generated using the control samples.
  2. Chromosome-level coverage analysis is carried out on the case sample, and any chromosomes with unexpectedly high or low coverages are identified.
  3. Sample coverages are normalized, and a global, target-level statistical model is set up for the variation in fold-change as a function of coverage in the baseline.
  4. Each chromosome is segmented into regions of similar fold-changes.
  5. The expected fold-change variation in region is determined using the statistical model for target-level coverages. Region-level CNVs are identified as the regions with fold-changes significantly different from 1.0.
  6. If chosen in the parameter steps, gene-level CNV calls are also produced.