SubPatCNV
SubPatCNV: Approximate Detection of Subgroup-Specific Copy-Number Variations
SubPatCNV implements an approximate subspace pattern mining algorithm to identify aberrant copy-number variation (CNV) regions specific to arbitrary sample subgroups in high-density array data, using spatial constraints and user-defined support thresholds.
Key Features:
- Approximate Subspace Pattern Mining: Detects consistent CNV subspace patterns within sample subsets using approximate association pattern mining.
- Spatial Constraints on CNV Probes: Applies positional constraints to CNV probe features to identify nearly identical deletions or insertions across subgroups.
- Support Threshold Filtering: Identifies aberrant CNV regions specific to subgroups exceeding a predefined minimum support.
- Scalability: Processes high-density array datasets and adapts to variable subgroup sizes.
Scientific Applications:
- Population-Specific Germline CNVs: Identified germline CNVs in HapMap samples across four distinct populations.
- Cancer Genomics: Detected large aberrant CNV events in TCGA ovarian cancer patient subgroups, highlighting regions enriched with cancer-related genes.
Methodology:
The method applies approximate association pattern mining under spatial constraints to high-density array-derived CNV probe data. It searches for nearly identical deletion or insertion patterns within subgroups that satisfy a minimum support threshold, enabling detection of subgroup-specific CNV regions across heterogeneous populations.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Mac
- Programming Languages:
- MATLAB, C++
- Added:
- 5/22/2018
- Last Updated:
- 12/10/2018
Operations
Publications
Johnson N, Zhang H, Fang G, Kumar V, Kuang R. SubPatCNV: approximate subspace pattern mining for mapping copy-number variations. BMC Bioinformatics. 2015;16(1). doi:10.1186/s12859-014-0426-7. PMID:25591662. PMCID:PMC4305219.