HIBAG

HIBAG imputes human leukocyte antigen (HLA) alleles from dense single-nucleotide polymorphism (SNP) genotype data to enable high-resolution HLA typing for genetic and immunogenetic studies by leveraging extended haplotype structure within the major histocompatibility complex (MHC).


Key Features:

  • Attribute bagging approach: Employs attribute bagging with bootstrap aggregating and random variable selection to construct an ensemble of classifiers and average HLA-type posterior probabilities for prediction.
  • Use of MHC haplotype structure: Leverages the extended haplotype structure within the MHC to predict HLA alleles from dense SNP genotypes.
  • Training parameter options: Can utilize published parameter estimates in place of individual-level training datasets.
  • Input data formats: Operates on dense SNP genotypes from genome-wide SNP panels, using SNP markers common across several Illumina platforms.
  • Performance assessment: Demonstrated prediction accuracies of 92.2%–98.1% for HLA-A, B, C, DRB1, and DQB1 using a 2668-subject European training set and independent validation in the British 1958 birth cohort (~1000 subjects).
  • Comparative evaluation: Performed competitively in comparative analyses against methods such as HLA*IMP and BEAGLE.
  • Implementation: Implemented as an R package and includes pre-fit classifiers for European, Asian, Hispanic, and African ancestries.

Scientific Applications:

  • Disease association studies: Imputes HLA alleles for association analyses of MHC-linked diseases and immune-related traits.
  • Adverse drug reaction research: Facilitates investigation of HLA-associated drug hypersensitivity and adverse drug reactions.
  • Population genetics: Enables large-scale population studies of HLA allele distributions across ancestries using SNP genotype datasets.
  • Genetic predisposition analyses: Supports studies of genetic predispositions and immunogenetic risk factors where direct HLA typing is impractical.

Methodology:

Constructs ensembles of classifiers via attribute bagging (bootstrap samples with random variable selection) and averages HLA-type posterior probabilities; can use published parameter estimates; validated using a 2668-subject European training set and independent validation in the British 1958 birth cohort (~1000 subjects) with SNP markers common across several Illumina platforms; implemented in an R package with pre-fit ancestry-specific classifiers.

Topics

Collections

Details

License:
GPL-3.0
Tool Type:
command-line tool, library
Operating Systems:
Windows, Mac
Programming Languages:
R
Added:
1/17/2017
Last Updated:
1/9/2019

Operations

Publications

Zheng X, Shen J, Cox C, Wakefield JC, Ehm MG, Nelson MR, Weir BS. HIBAG—HLA genotype imputation with attribute bagging. The Pharmacogenomics Journal. 2013;14(2):192-200. doi:10.1038/tpj.2013.18. PMID:23712092. PMCID:PMC3772955.

Documentation

Downloads

Links