PlatinumCNV

PlatinumCNV infers allele-specific copy number genotypes genome-wide from SNP array fluorescent signal intensity data using a Bayesian Gaussian mixture model to analyze allele-specific copy number polymorphisms (CNPs).


Key Features:

  • Bayesian Gaussian Mixture Model: Uses a Bayesian GMM to cluster fluorescent signal intensities from SNP arrays into allele-specific copy number genotype clusters.
  • Overlap and Error Handling: Allows overlapping cluster memberships to accommodate instrumental errors, absolute scale differences, and X/Y scale imbalances in signal intensities.
  • Robustness to Unobserved Genotypes: Remains robust when some allele-specific copy number genotypes are unobserved at a locus.
  • Applied to Illumina HumanHap 610K: Has been applied to commercial SNP array intensity data from the Illumina HumanHap 610K platform.
  • Discovery of CNPs: Identified over 4,000 allele-specific CNPs, with most showing simple patterns characterized by a single aneuploid haplotype among normal haplotypes.
  • Validated Genotyping Accuracy: Achieved mean genotyping error rates as low as 1% validated by quantitative PCR and replicated subjects.
  • Foundation for GWAS: Facilitates genome-wide mapping of CNPs to support statistical analyses for disease gene and quantitative trait locus identification.

Scientific Applications:

  • Population and genetic diversity studies: Characterize allele-specific copy number polymorphism distributions across samples and populations.
  • Disease gene mapping: Map allele-specific CNPs that may be associated with disease genes.
  • Quantitative trait locus (QTL) analysis: Identify QTLs linked to allele-specific copy number variation.
  • Genome-wide association studies (GWAS): Provide allele-specific CNP genotypes and loci for inclusion in GWAS of complex traits.

Methodology:

Clusters SNP-array fluorescent signal intensities using a Bayesian Gaussian mixture model into clouds corresponding to allele-specific copy number genotypes, allowing overlapping cluster memberships to accommodate absolute scale differences and X/Y scale imbalances and to remain robust to unobserved genotypes.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
R, C
Added:
8/3/2017
Last Updated:
11/25/2024

Operations

Publications

Kumasaka N, Fujisawa H, Hosono N, Okada Y, Takahashi A, Nakamura Y, Kubo M, Kamatani N. PlatinumCNV: A Bayesian Gaussian mixture model for genotyping copy number polymorphisms using SNP array signal intensity data. Genetic Epidemiology. 2011;35(8):831-844. doi:10.1002/gepi.20633. PMID:22125222.

Documentation

Links