SAIGEgds

SAIGEgds is a high-performance statistical R package designed for large-scale Phenome-wide Association Studies (PheWAS). PheWAS is a powerful tool for discovering and replicating genetic associations across a wide range of phenotypes. To address computational challenges in analyzing large cohorts, such as the UK Biobank, SAIGE was introduced to handle case-control imbalance and sample relatedness efficiently. However, SAIGE remains computationally intensive, especially when dealing with thousands of ICD10-coded phenotypes and whole-genome imputed genotype data.

SAIGEgds optimizes the SAIGE method by implementing it in C++ codes and incorporating sparse genotype dosages and an efficient genomic data structure file format. This results in a 5-6 times faster analysis compared to the original SAIGE R package.

Topic

Biobank;Genotype and phenotype;GWAS study;DNA polymorphism

Detail

  • Operation: Imputation;Genotyping;Regression analysis

  • Software interface: Command-line user interface

  • Language: R,C++

  • License: The GNU General Public License v3.0

  • Cost: Free

  • Version name: 2.2.0

  • Credit: AbbVie.

  • Input: -

  • Output: -

  • Contact: Xiuwen Zheng xiuwen.zheng@abbvie.com

  • Collection: -

  • Maturity: Mature

Publications

  • SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models.
  • Zheng X and Davis JW. SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models. SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models. 2021; 37:728-730. doi: 10.1093/bioinformatics/btaa731
  • https://doi.org/10.1093/BIOINFORMATICS/BTAA731
  • PMID: 32898220
  • PMC: -

Download and documentation


< Back to DB search