Ohana
Ohana infers individual ancestry components and estimates population trees from genotype data using a maximum likelihood optimization of the STRUCTURE model while handling genotype likelihoods from NGS data.
Key Features:
- Optimization Algorithm for STRUCTURE Model: Implements a novel optimization algorithm tailored to the classical STRUCTURE model within a maximum likelihood framework, achieving higher likelihood solutions with comparable computational efficiency.
- Genotype Likelihoods Integration: Incorporates genotype likelihoods to account for genotype-calling uncertainty associated with next-generation sequencing (NGS) data.
- Population Tree Estimation: Employs a Gaussian approximation to estimate population trees from inferred ancestry components and visualize evolutionary relationships among populations.
- Simulation-Based Validation: Validated using coalescence simulations of diverging populations to assess accuracy of ancestry component inference and population tree topologies.
- Visualization Tools: Provides visualization for interpreting inferred ancestry components and population tree structures.
Scientific Applications:
- Fractional Ancestry Classification: Assigns individuals fractionally into discrete ancestry components.
- Admixture Inference: Infers historical admixture events and estimates their timelines.
- Phylogenetic Reconstruction: Constructs population trees reflecting genetic divergence among populations.
Methodology:
Optimization of the STRUCTURE model within a maximum likelihood framework, handling of genotype likelihoods from NGS data, population tree estimation via a Gaussian approximation, and validation through coalescence simulations.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Mac
- Programming Languages:
- C++
- Added:
- 6/4/2018
- Last Updated:
- 11/25/2024
Operations
Publications
Cheng JY, Mailund T, Nielsen R. Fast admixture analysis and population tree estimation for SNP and NGS data. Bioinformatics. 2017;33(14):2148-2155. doi:10.1093/bioinformatics/btx098. PMID:28334108. PMCID:PMC6543773.
PMID: 28334108
PMCID: PMC6543773
Funding: - Danish Council of Independent Research Sapere Aude: 12-125062