HapTree

HapTree assembles haplotypes from polyploid genomes using a maximum-likelihood estimation framework to enable accurate phasing of higher-ploidy organisms from next-generation sequencing data.


Key Features:

  • Maximum-Likelihood Estimation Framework: HapTree employs a maximum-likelihood estimation framework specifically tailored for polyploid genomes to manage the complexity of higher ploidy levels.
  • Scalability and Efficiency: The method provides scalability for higher-ploidy haplotype assembly, improving both accuracy and computational efficiency for triploid and higher-ploidy genomes.
  • Performance Evaluation with Simulated Illumina Reads: Performance was evaluated using simulated polyploid sequencing read data modeled after Illumina sequencing technologies, demonstrating superior accuracy and efficiency compared to existing methods for higher ploidy.
  • Validation on NA12878 (1000 Genomes Project): Tested on real sequencing data from the NA12878 sample, showing improvements in switch accuracy within phased haplotype blocks while maintaining comparable minimum error correction (MEC) values.

Scientific Applications:

  • Plant Genomics: Enables haplotype assembly and phasing analysis in polyploid plant species where higher ploidy is common.
  • Yeast and Fish Genomics: Supports studies of genetic diversity and evolutionary processes in yeast and fish with non-diploid genomes.
  • Eukaryotic Evolution and (Epi)genetic Interaction Studies: Provides phased haplotypes that facilitate investigation of (epi)genetic interactions between gene copies and evolutionary dynamics in eukaryotes.

Methodology:

Uses a maximum-likelihood estimation framework tailored for polyploid genomes; performance assessed with simulated Illumina-modeled polyploid reads and validated on NA12878 (1000 Genomes Project) sequencing data.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux, Mac
Programming Languages:
Python
Added:
8/3/2017
Last Updated:
11/25/2024

Operations

Publications

Berger E, Yorukoglu D, Peng J, Berger B. HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data. PLoS Computational Biology. 2014;10(3):e1003502. doi:10.1371/journal.pcbi.1003502. PMID:24675685. PMCID:PMC3967924.

Documentation

Links