Contra

Contra detects copy number variations (CNVs) from targeted resequencing data, including whole-exome capture, to identify copy number gains and losses relevant for genetic studies of disease.


Key Features:

  • CNV detection: Identifies copy number gains and losses across target regions by analyzing normalized depth of coverage from sequencing data.
  • GC-content bias removal: Mitigates GC-content bias using base-level log-ratios.
  • Library size effect correction: Corrects for imbalances in library size that affect log-ratio calculations.
  • Log-ratio estimation: Estimates log-ratio variations through binning and interpolation to smooth noise and refine copy-number estimates.
  • File format integration: Accepts BAM/SAM alignments and outputs CNV calls in VCF 4.0.
  • Targeted resequencing support: Operates on targeted resequencing datasets, including whole-exome capture and other target enrichment assays.

Scientific Applications:

  • CNV discovery in targeted studies: Detects CNVs from targeted resequencing and whole-exome capture datasets for studies of genetic variation.
  • Disease-associated variant analysis: Supports identification of genetic variations linked to disease by reporting copy number gains and losses.
  • Benchmarking and validation: Has been evaluated using samples from seven target enrichment assays, simulations, and real germline data with known CNV genotypes.

Methodology:

Analyzes normalized depth of coverage from BAM/SAM, computes base-level log-ratios, applies GC-content and library-size corrections, performs binning and interpolation for log-ratio estimation, and outputs CNV calls in VCF 4.0.

Topics

Details

License:
GPL-3.0
Maturity:
Mature
Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
R, Python
Added:
1/13/2017
Last Updated:
11/24/2024

Operations

Publications

Li J, Lupat R, Amarasinghe KC, Thompson ER, Doyle MA, Ryland GL, Tothill RW, Halgamuge SK, Campbell IG, Gorringe KL. CONTRA: copy number analysis for targeted resequencing. Bioinformatics. 2012;28(10):1307-1313. doi:10.1093/bioinformatics/bts146. PMID:22474122. PMCID:PMC3348560.

Documentation