CNVnator

CNVnator detects and genotypes copy number variations (CNVs) from whole-genome sequencing data using read-depth analysis.


Key Features:

  • Read-depth analysis: Uses read-depth from personal/whole-genome sequencing to discover and genotype CNVs.
  • Mean-shift segmentation: Employs a refined mean-shift approach with multiple-bandwidth partitioning to identify CNV regions.
  • GC-content correction: Applies GC content correction to adjust read-depth signals for sequence composition bias.
  • Calibration and validation: Calibrated and validated using validation data from the 1000 Genomes Project.
  • Performance metrics: Reports sensitivity of 86%–96%, false-discovery rate of 3%–20%, genotyping accuracy of 93%–95%, and breakpoint resolution typically <200 base pairs in 90% of cases with sufficient sequencing coverage.
  • Complementary detection: Identifies over half of validated CNVs missed by split-read and read-pair methods while not detecting CNVs generated by retrotransposable elements.
  • Population genotyping: Genotypes CNVs across CEPH, Yoruba, and Chinese-Japanese cohorts and identifies multi-allelic loci.
  • De novo and multi-allelic CNV detection: Detects de novo and multi-allelic CNVs and contributed to the identification of six potential de novo CNVs in two family trios.

Scientific Applications:

  • Population studies: Characterizes CNV distributions and genetic diversity within and between populations.
  • Characterization of atypical CNVs: Detects de novo and multi-allelic CNVs and complex loci for downstream analysis.
  • Genetic research: Enables genotyping across CEPH, Yoruba, and Chinese-Japanese groups and supports estimates that at least 11% of CNV loci are multi-allelic.
  • Evolutionary analysis: Detects deviations from Hardy-Weinberg equilibrium at complex loci, indicating potential selection pressures.
  • Familial studies: Supports discovery of potential de novo CNVs within family trios for hereditary investigations.

Methodology:

Performs read-depth analysis using a refined mean-shift algorithm with multiple-bandwidth partitioning and GC-content correction, calibrated and validated against 1000 Genomes Project data, and reports breakpoint localization typically <200 base pairs with sufficient sequencing coverage.

Topics

Collections

Details

License:
Other
Tool Type:
command-line tool
Programming Languages:
C++
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Publications

Abyzov A, Urban AE, Snyder M, Gerstein M. CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Research. 2011;21(6):974-984. doi:10.1101/gr.114876.110. PMID:21324876. PMCID:PMC3106330.

Documentation