Zinfandel

Zinfandel detects copy number variants (CNVs) from low-pass whole genome sequencing (WGS) data to identify medium-size deletions (200–2000 base pairs) relevant to disease and population genomics.


Key Features:

  • Low-Pass Whole Genome Sequencing Compatibility: Operates on low-depth WGS data, typically below 10× coverage per sample.
  • Hidden Markov Model Integration: Uses a Hidden Markov Model that jointly incorporates depth of coverage and mate-pair relationship data.
  • Mate-Pair Relationship Utilization: Infers deletion likelihood by jointly analyzing multiple mate pairs within a region rather than relying on single outlier pairs, improving detection of medium-size deletions (200–2000 bp) at low depth.
  • Optimal Power and Resolution: Calibrated to achieve improved power and resolution for CNV detection across varying experimental designs.

Scientific Applications:

  • Disease Genomics: Identification of CNVs associated with genetic disorders and elucidation of their contributions to disease phenotypes.
  • Population Genetics: Analysis of genomic diversity and population-specific CNV patterns to inform evolutionary and epidemiological studies.

Methodology:

Applies a Hidden Markov Model that integrates depth of coverage and mate-pair relationship data from low-pass WGS and jointly infers deletion likelihood across multiple mate pairs to detect medium-size deletions (200–2000 bp) without relying on single outlier pairs.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
Java
Added:
8/3/2017
Last Updated:
11/25/2024

Operations

Publications

Shen Y, Gu Y, Pe’er I. A Hidden Markov Model for Copy Number Variant prediction from whole genome resequencing data. BMC Bioinformatics. 2011;12(S6). doi:10.1186/1471-2105-12-s6-s4. PMID:21989326. PMCID:PMC3194192.

Documentation

Links