GSAR

"GSAR" (Gene Set Analysis in R) introduces an innovative approach to analyzing differentially expressed gene sets in gene expression data, focusing on the use of minimum-spanning tree (MST)-based non-parametric multivariate tests. While a wide range of tests are available for gene set analysis, including aggregation tests and multivariate tests that consider intergene correlations, many of these tests lack specificity in identifying the exact nature of the deviation from the null hypothesis upon rejection.

The core contribution of GSAR is its comparison of the power and Type I error rates of MST-based tests with other commonly used tests in pathway analyses. Through simulation studies, GSAR demonstrates that MST-based tests have strength comparable to conventional tests in many scenarios and offer superior performance in specific parameter spaces that are biologically relevant. These tests are particularly effective in distinguishing between shift and scale alternatives, a critical aspect of analyzing gene expression data.

GSAR proposes a two-step practical analysis strategy to enhance the interpretability of experimental data. The first step involves applying a powerful multivariate test to identify gene sets where the null hypothesis is rejected. The second step uses MST-based tests on these identified pathways to pinpoint those supporting specific alternative hypotheses. This approach not only refines the analysis process but also clarifies the underlying biological phenomena.

Topic

Statistics and probability;RNA-seq;Microarray experiment;Gene expression

Detail

  • Operation: Differential gene expression analysis;Gene expression correlation

  • Software interface: Command-line user interface,Library

  • Language: R

  • License: GNU General Public License, version 2

  • Cost: Free

  • Version name: 1.36.0

  • Credit: The Arkansas Biosciences Institute, the major research component of the Arkansas Tobacco Settlement Proceeds Act of 2000, the Arkansas Translational Research Institute, NIH, The, EPSRC.

  • Input: Gene expression profile [Textual format]

  • Output: P-value [Textual format] [Image format], Plot [Textual format] [Image format], Pathway or network [Textual format] [Image format], Report [Textual format] [Image format]

  • Contact: Yasir Rahmatallah yrahm@allah@uams.edu,GalinaGlazko

  • Collection: -

  • Maturity: Stable

Publications

  • Gene set analysis for self-contained tests: complex null and specific alternative hypotheses.
  • Rahmatallah Y, et al. Gene set analysis for self-contained tests: complex null and specific alternative hypotheses. Gene set analysis for self-contained tests: complex null and specific alternative hypotheses. 2012; 28:3073-80. doi: 10.1093/bioinformatics/bts579
  • https://doi.org/10.1093/bioinformatics/bts579
  • PMID: 23044539
  • PMC: PMC3509490
  • GSAR: Bioconductor package for Gene Set analysis in R.
  • Rahmatallah Y, et al. GSAR: Bioconductor package for Gene Set analysis in R. GSAR: Bioconductor package for Gene Set analysis in R. 2017; 18:61. doi: 10.1186/s12859-017-1482-6
  • https://doi.org/10.1186/s12859-017-1482-6
  • PMID: 28118818
  • PMC: PMC5259853

Download and documentation


< Back to DB search