MCbiclust

MCbiclust identifies large correlated biclusters in gene expression and transcriptomics datasets to uncover co-regulated gene networks within biologically heterogeneous data.


Key Features:

  • Novel Biclustering Algorithm: Selects both genes and samples from large datasets to detect large biclusters by maximizing correlation in gene expression.
  • Handling Biological Heterogeneity: Manages biologically heterogeneous transcriptomics collections to reveal co-regulated gene modules across diverse samples.
  • Biological Relevance: Validated on synthetic datasets and applied to large bacterial and cancer cell datasets, yielding biclusters with statistical significance and biological relevance.
  • Applications in Disease Diagnosis and Network Analysis: Enables identification of transcriptomics-based diagnostic indicators and downstream network analyses to investigate genotype-phenotype correlations.

Scientific Applications:

  • Co-regulated Gene Network Discovery: Detects large co-regulated gene networks and biclusters in gene expression datasets.
  • Multi-gene Controlled Processes: Identifies processes such as energy metabolism, organelle biogenesis, and stress responses that are controlled by multiple genes.
  • Disease-associated Transcriptomics: Facilitates discovery of transcriptomics features relevant to disease diagnosis and comparison across conditions.
  • Benchmarking and Validation: Supports benchmarking using synthetic datasets and application to large bacterial and cancer cell datasets.

Methodology:

Selects a supplied gene set of size n and identifies the maximum-strength correlation matrix containing m samples by selecting genes and samples that maximize correlation in gene expression.

Topics

Collections

Details

License:
GPL-2.0
Tool Type:
library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
R
Added:
7/11/2018
Last Updated:
12/10/2018

Operations

Publications

Bentham RB, Bryson K, Szabadkai G. MCbiclust: a novel algorithm to discover large-scale functionally related gene sets from massive transcriptomics data collections. Nucleic Acids Research. 2017;45(15):8712-8730. doi:10.1093/nar/gkx590. PMID:28911113. PMCID:PMC5587796.

Documentation

Downloads

Links