polyHap

polyHap infers haplotypes in polyploid organisms and genomic regions with copy number variations (CNVs), enabling accurate phasing and supporting haplotype-based analyses.


Key Features:

  • HMM with sampling algorithm: Uses a hidden Markov model (HMM) combined with a sampling algorithm to model phase and provide probabilistic inference.
  • Joint haplotype inference: Performs joint inference of haplotypes across multiple individuals.
  • Uncertainty quantification: Provides a quantifiable measure of uncertainty for phased haplotypes.
  • Variable ploidy and CNV support: Handles varying ploidy levels across individuals while assuming constant ploidy within analyzed genomic regions and phases regions with copy number variations (CNVs).
  • Supported ploidy levels: Manages diploid, triploid, and tetraploid genotypes.
  • Reduced phasing and imputation errors: Demonstrates reduction of switch errors and imputation errors for missing genotypes in evaluations.
  • Simulation-based validation: Validated using simulations that generate artificial polyploid genotypes from real haplotype data.
  • Comparative performance: Shown to outperform fastPhase for diploids and SATlotyper for tetraploids in densely genotyped regions.

Scientific Applications:

  • Haplotype-based association studies: Enables haplotype-based association analyses in polyploid organisms and CNV regions.
  • Selection scans: Facilitates identification of genomic regions under selection using phased haplotypes.
  • Ancestral inference: Supports ancestral inference through reconstructed haplotypes.
  • Dense SNP dataset analysis in polyploids: Allows analysis of large SNP datasets from polyploid organisms and CNV regions.

Methodology:

Uses a hidden Markov model (HMM) combined with a sampling algorithm to jointly infer haplotypes across multiple individuals and provide measures of uncertainty.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux, Windows, Mac
Programming Languages:
Java
Added:
12/18/2017
Last Updated:
11/25/2024

Operations

Publications

Su S, White J, Balding DJ, Coin LJ. Inference of haplotypic phase and missing genotypes in polyploid organisms and variable copy number genomic regions. BMC Bioinformatics. 2008;9(1). doi:10.1186/1471-2105-9-513. PMID:19046436. PMCID:PMC2647950.

Documentation

Links