SNP2HLA

SNP2HLA imputes amino acid polymorphisms and single nucleotide polymorphisms (SNPs) within human leukocyte antigen (HLA) genes in the major histocompatibility complex (MHC) to enable HLA variant interrogation from genome-wide association study (GWAS) datasets.


Key Features:

  • Imputation strategy: Leverages long-range linkage disequilibrium between HLA loci and SNP markers across the MHC to impute classical HLA alleles and amino acid polymorphisms.
  • Target loci: Supports class I loci HLA-A, -B, -C and class II loci HLA-DPA1, -DPB1, -DQA1, -DQB1, and -DRB1.
  • Reference panels: Evaluated using two European-ancestry panels: HapMap-CEPH pedigrees (n=90) and the Type 1 Diabetes Genetics Consortium (T1DGC, n=5,225).
  • Resolution: Imputes classical alleles at four-digit resolution and amino acid polymorphisms within HLA genes.
  • Imputation accuracy: With the T1DGC panel, average allele imputation accuracy is 94.7% using the Affymetrix GeneChip 500K and 96.7% using the Illumina Immunochip; amino acid polymorphism accuracies reach 98.6% and 99.3% on these platforms, respectively.
  • Reference panel importance: Demonstrates that reference panel size is more critical than SNP density of the genotyping platform for achieving high imputation accuracy.
  • GWAS compatibility: Enables interrogation of HLA variation from existing GWAS datasets and was evaluated on Affymetrix GeneChip 500K and Illumina Immunochip data.
  • Application in genetic studies: Facilitates association testing and fine-mapping at amino acid resolution within the MHC for disease genetics research.

Scientific Applications:

  • Fine-mapping MHC signals: Enables fine-mapping of primary MHC association signals at the allele and amino acid level.
  • Association testing at amino acid resolution: Supports association analyses of amino acid polymorphisms within HLA genes.
  • Disease genetics: Applied to study genetic susceptibility to diseases mediated by HLA variation, including type 1 diabetes.
  • Reuse of GWAS data: Allows HLA variant interrogation using existing GWAS cohorts genotyped on platforms such as Affymetrix GeneChip 500K and Illumina Immunochip.

Methodology:

Imputation uses long-range linkage disequilibrium between HLA loci and SNP markers across the MHC with reference panels (HapMap-CEPH n=90; T1DGC n=5,225) to impute classical four-digit HLA alleles and amino acid polymorphisms at class I and class II loci, with accuracy evaluated on Affymetrix GeneChip 500K and Illumina Immunochip.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
Java
Added:
12/18/2017
Last Updated:
11/25/2024

Operations

Publications

Jia X, Han B, Onengut-Gumuscu S, Chen W, Concannon PJ, Rich SS, Raychaudhuri S, de Bakker PI. Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens. PLoS ONE. 2013;8(6):e64683. doi:10.1371/journal.pone.0064683. PMID:23762245. PMCID:PMC3675122.

Documentation

Links