Molgenis-impute

Molgenis-impute performs genotype imputation and automates preprocessing, phasing, chunking, and imputation steps to support GWAS, meta-analysis, and fine-mapping of large-scale genotype datasets.


Key Features:

  • Automated pipeline setup: Automates setup and execution of imputation pipeline steps including liftover, phasing, quality control, chunking/merging, and imputation.
  • Genome build liftover and quality control: Performs genome build liftover and genotype quality control prior to phasing and imputation.
  • Genotype phasing (SHAPEIT2): Uses SHAPEIT2 for genotype phasing.
  • Imputation (IMPUTE2): Performs genotype imputation using IMPUTE2.
  • Sample and chromosomal chunking/merging: Supports chunking and merging by sample and chromosome to enable parallel processing.
  • Integration with MOLGENIS-compute: Submits and monitors computational tasks via MOLGENIS-compute for execution on HPC environments.
  • Flexibility and customization: Enables adaptation and integration with other workflows through the MOLGENIS-compute framework to add or modify computational steps.
  • Scalability and tested performance: Tested on PBS/SGE clusters, cloud VMs, and grid HPC systems, and applied to impute over 30,000 samples using the 1,000 Genomes Project and Genome of the Netherlands reference datasets.

Scientific Applications:

  • GWAS, meta-analysis, and fine-mapping: Provides imputed genotype data required for genome-wide association studies, meta-analyses, and fine-mapping of association signals.
  • Large-scale genotype cohort processing: Enables imputation of large genotype cohorts through parallelized chunking and HPC execution.
  • Reference-panel-based imputation: Supports use of reference panels such as the 1,000 Genomes Project and the Genome of the Netherlands for genotype imputation.

Methodology:

Performs genome build liftover, genotype quality control, phasing with SHAPEIT2, sample and chromosomal chunking/merging, and imputation with IMPUTE2, with task submission and monitoring via MOLGENIS-compute on HPC systems.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux, Mac
Programming Languages:
Java, Python
Added:
5/30/2018
Last Updated:
12/10/2018

Operations

Publications

Kanterakis A, Deelen P, van Dijk F, Byelas H, Dijkstra M, Swertz MA. Molgenis-impute: imputation pipeline in a box. BMC Research Notes. 2015;8(1). doi:10.1186/s13104-015-1309-3. PMID:26286716. PMCID:PMC4541731.

Documentation