MCbiclust
MCbiclust identifies large correlated biclusters in gene expression and transcriptomics datasets to uncover co-regulated gene networks within biologically heterogeneous data.
Key Features:
- Novel Biclustering Algorithm: Selects both genes and samples from large datasets to detect large biclusters by maximizing correlation in gene expression.
- Handling Biological Heterogeneity: Manages biologically heterogeneous transcriptomics collections to reveal co-regulated gene modules across diverse samples.
- Biological Relevance: Validated on synthetic datasets and applied to large bacterial and cancer cell datasets, yielding biclusters with statistical significance and biological relevance.
- Applications in Disease Diagnosis and Network Analysis: Enables identification of transcriptomics-based diagnostic indicators and downstream network analyses to investigate genotype-phenotype correlations.
Scientific Applications:
- Co-regulated Gene Network Discovery: Detects large co-regulated gene networks and biclusters in gene expression datasets.
- Multi-gene Controlled Processes: Identifies processes such as energy metabolism, organelle biogenesis, and stress responses that are controlled by multiple genes.
- Disease-associated Transcriptomics: Facilitates discovery of transcriptomics features relevant to disease diagnosis and comparison across conditions.
- Benchmarking and Validation: Supports benchmarking using synthetic datasets and application to large bacterial and cancer cell datasets.
Methodology:
Selects a supplied gene set of size n and identifies the maximum-strength correlation matrix containing m samples by selecting genes and samples that maximize correlation in gene expression.
Topics
Collections
Details
- License:
- GPL-2.0
- Tool Type:
- library
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- R
- Added:
- 7/11/2018
- Last Updated:
- 12/10/2018
Operations
Publications
Bentham RB, Bryson K, Szabadkai G. MCbiclust: a novel algorithm to discover large-scale functionally related gene sets from massive transcriptomics data collections. Nucleic Acids Research. 2017;45(15):8712-8730. doi:10.1093/nar/gkx590. PMID:28911113. PMCID:PMC5587796.