GWAS catalog
GWAS Catalog aggregates curated results and metadata from published genome-wide association studies (GWAS), focusing on single-nucleotide polymorphism (SNP)–trait associations from studies assaying at least 100,000 SNPs and reporting associations with P < 1×10^-5.
Key Features:
- Curated Data Collection: As of September 2018, contains data from 5,687 GWAS, encompassing 71,673 variant–trait associations derived from 3,567 publications, including 11,912 SNPs and 1,751 curated publications.
- Summary statistics datasets: Provides 284 full P-value summary statistics datasets for genome-wide and targeted array studies representing over 600 million individual variant–trait statistics.
- Assay and technology scope: Includes data from genome-wide arrays, targeted arrays, exome arrays, and sequencing studies and incorporates approximately 1,000 new associations from targeted and exome array technologies.
- Structured ancestry and recruitment metadata: Records structured ancestry and recruitment information for all studies.
- Data formats and knowledge representation: Disseminates data in tab-delimited files and an OWL knowledge base.
- Summary statistics standardization: Supports a proposed community standard format for summary statistics to standardize heterogeneous submissions and harmonize content.
Scientific Applications:
- Causal variant identification: Enables identification and prioritization of candidate causal variants underlying trait associations.
- Disease mechanism elucidation: Facilitates investigation of genetic contributions to disease mechanisms through aggregated SNP–trait associations.
- Therapeutic target discovery: Supports discovery and validation of potential targets for novel therapies based on association signals.
- Population and study-design comparisons: Allows exploration of genetic associations across diverse populations, ancestry annotations, and different study designs and technologies.
Methodology:
Utilizes an OWL knowledge base and ontology-supported methods and performs summary-statistics standardization and harmonization of heterogeneous submissions.
Topics
Collections
Details
- License:
- Apache-2.0
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- api, web application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 7/4/2017
- Last Updated:
- 11/24/2024
Operations
Publications
Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, Klemm A, Flicek P, Manolio T, Hindorff L, Parkinson H. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Research. 2013;42(D1):D1001-D1006. doi:10.1093/nar/gkt1229. PMID:24316577. PMCID:PMC3965119.
MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, Junkins H, McMahon A, Milano A, Morales J, Pendlington ZM, Welter D, Burdett T, Hindorff L, Flicek P, Cunningham F, Parkinson H. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Research. 2016;45(D1):D896-D901. doi:10.1093/nar/gkw1133. PMID:27899670. PMCID:PMC5210590.
Buniello A, MacArthur JAL, Cerezo M, Harris LW, Hayhurst J, Malangone C, McMahon A, Morales J, Mountjoy E, Sollis E, Suveges D, Vrousgou O, Whetzel PL, Amode R, Guillen JA, Riat HS, Trevanion SJ, Hall P, Junkins H, Flicek P, Burdett T, Hindorff LA, Cunningham F, Parkinson H. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Research. 2018;47(D1):D1005-D1012. doi:10.1093/nar/gky1120. PMID:30445434. PMCID:PMC6323933.
Documentation
Downloads
- Downloads pagehttps://www.ebi.ac.uk/gwas/downloads