SAIGEgds
SAIGEgds is a high-performance statistical R package designed for large-scale Phenome-wide Association Studies (PheWAS). PheWAS is a powerful tool for discovering and replicating genetic associations across a wide range of phenotypes. To address computational challenges in analyzing large cohorts, such as the UK Biobank, SAIGE was introduced to handle case-control imbalance and sample relatedness efficiently. However, SAIGE remains computationally intensive, especially when dealing with thousands of ICD10-coded phenotypes and whole-genome imputed genotype data.
SAIGEgds optimizes the SAIGE method by implementing it in C++ codes and incorporating sparse genotype dosages and an efficient genomic data structure file format. This results in a 5-6 times faster analysis compared to the original SAIGE R package.
Topic
Biobank;Genotype and phenotype;GWAS study;DNA polymorphism
Detail
Operation: Imputation;Genotyping;Regression analysis
Software interface: Command-line user interface
Language: R,C++
License: The GNU General Public License v3.0
Cost: Free
Version name: 2.2.0
Credit: AbbVie.
Input: -
Output: -
Contact: Xiuwen Zheng xiuwen.zheng@abbvie.com
Collection: -
Maturity: Mature
Publications
- SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models.
- Zheng X and Davis JW. SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models. SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models. 2021; 37:728-730. doi: 10.1093/bioinformatics/btaa731
- https://doi.org/10.1093/BIOINFORMATICS/BTAA731
- PMID: 32898220
- PMC: -
Download and documentation
Source: https://bioconductor.org/packages/release/bioc/src/contrib/SAIGEgds_2.2.0.tar.gz
Documentation: https://bioconductor.org/packages/release/bioc/manuals/SAIGEgds/man/SAIGEgds.pdf
Home page: https://bioconductor.org/packages/SAIGEgds
Links: https://bioconductor.org/packages/release/bioc/vignettes/SAIGEgds/inst/doc/SAIGEgds.R
< Back to DB search