ADMIXPIPE

ADMIXPIPE automates preprocessing, SNP filtering, and result summarization for ADMIXTURE-based inference of population structure from VCF datasets, including reduced-representation sequencing data such as ddRAD.


Key Features:

  • VCF parsing and filtering: Parses Variant Call Format (VCF) files and applies filters to prepare genotype data for downstream analyses.
  • SNP filtering: Implements single nucleotide polymorphism (SNP) filtering to select loci appropriate for population-structure inference.
  • ADMIXTURE integration: Runs ADMIXTURE (maximum-likelihood framework) analyses, including replicated runs.
  • Optimal K inference: Infers the optimal number of population clusters (K) from replicated ADMIXTURE runs.
  • CLUMPAK integration: Summarizes and provides graphical representation of ADMIXTURE run variation via CLUMPAK integration.
  • Parallel processing: Supports parallel execution to improve efficiency on large population genomic datasets.
  • Support for reduced-representation data: Tailored to handle datasets generated by protocols such as double digest RAD (ddRAD).

Scientific Applications:

  • Population structure inference: Infers genetic population structure using ADMIXTURE-based analyses.
  • Analyses of non-model organisms: Processes genomic data from organisms lacking complete reference genomes, including ddRAD-derived datasets.
  • Admixture pattern characterization: Identifies and summarizes admixture patterns across replicated analyses to aid selection of K.
  • Genetic diversity and evolutionary relationships: Supports studies of genetic diversity and evolutionary relationships within and among populations.
  • Large-scale population genomics: Enables processing and analysis of large population genomic datasets through parallelization.

Methodology:

Parses and filters VCF files and SNPs, executes replicated ADMIXTURE (maximum-likelihood) runs, infers optimal K from replicates, and summarizes results with CLUMPAK while supporting parallel execution.

Topics

Details

License:
GPL-3.0
Tool Type:
command-line tool, workflow
Operating Systems:
Mac, Linux
Programming Languages:
Python, Shell
Added:
1/18/2021
Last Updated:
1/21/2021

Operations

Publications

Mussmann SM, Douglas MR, Chafin TK, Douglas ME. AdmixPipe: population analyses in Admixture for non-model organisms. BMC Bioinformatics. 2020;21(1). doi:10.1186/s12859-020-03701-4. PMID:32727359. PMCID:PMC7391514.