ADMIXPIPE
ADMIXPIPE automates preprocessing, SNP filtering, and result summarization for ADMIXTURE-based inference of population structure from VCF datasets, including reduced-representation sequencing data such as ddRAD.
Key Features:
- VCF parsing and filtering: Parses Variant Call Format (VCF) files and applies filters to prepare genotype data for downstream analyses.
- SNP filtering: Implements single nucleotide polymorphism (SNP) filtering to select loci appropriate for population-structure inference.
- ADMIXTURE integration: Runs ADMIXTURE (maximum-likelihood framework) analyses, including replicated runs.
- Optimal K inference: Infers the optimal number of population clusters (K) from replicated ADMIXTURE runs.
- CLUMPAK integration: Summarizes and provides graphical representation of ADMIXTURE run variation via CLUMPAK integration.
- Parallel processing: Supports parallel execution to improve efficiency on large population genomic datasets.
- Support for reduced-representation data: Tailored to handle datasets generated by protocols such as double digest RAD (ddRAD).
Scientific Applications:
- Population structure inference: Infers genetic population structure using ADMIXTURE-based analyses.
- Analyses of non-model organisms: Processes genomic data from organisms lacking complete reference genomes, including ddRAD-derived datasets.
- Admixture pattern characterization: Identifies and summarizes admixture patterns across replicated analyses to aid selection of K.
- Genetic diversity and evolutionary relationships: Supports studies of genetic diversity and evolutionary relationships within and among populations.
- Large-scale population genomics: Enables processing and analysis of large population genomic datasets through parallelization.
Methodology:
Parses and filters VCF files and SNPs, executes replicated ADMIXTURE (maximum-likelihood) runs, infers optimal K from replicates, and summarizes results with CLUMPAK while supporting parallel execution.
Topics
Details
- License:
- GPL-3.0
- Tool Type:
- command-line tool, workflow
- Operating Systems:
- Mac, Linux
- Programming Languages:
- Python, Shell
- Added:
- 1/18/2021
- Last Updated:
- 1/21/2021
Operations
Publications
Mussmann SM, Douglas MR, Chafin TK, Douglas ME. AdmixPipe: population analyses in Admixture for non-model organisms. BMC Bioinformatics. 2020;21(1). doi:10.1186/s12859-020-03701-4. PMID:32727359. PMCID:PMC7391514.