GSEABenchmarkeR

GSEABenchmarkeR provides a reproducible and extensible framework for benchmarking gene set enrichment and network-based enrichment methods using large-scale microarray and RNA-seq expression compendia.


Key Features:

  • R/Bioconductor package: Implemented as an R/Bioconductor package for use within Bioconductor workflows.
  • Benchmark scope: Benchmarks gene set enrichment and network-based enrichment methodologies.
  • Input data types: Operates on large-scale expression compendia derived from microarray and RNA-seq studies, including curated phenotype-rich expression collections.
  • Scalability and parallel execution: Supports efficient parallel execution on local workstations, high-performance computing environments, and institutional compute grids to scale to hundreds or thousands of datasets.
  • Evaluation metrics: Quantitatively assesses computational runtime, detection of statistically significant gene sets, and biological relevance of enriched pathways with respect to phenotypes.
  • Extensibility: Enables incorporation of new gene set enrichment or network-based methods and definition of custom evaluation metrics.
  • Standardized inputs and outputs: Provides uniform input handling, standardized execution pipelines, and structured result objects.

Scientific Applications:

  • Method comparison: Compare competing gene set and network-based enrichment algorithms in a controlled and standardized manner.
  • Performance evaluation: Quantitatively evaluate algorithm performance across hundreds or thousands of microarray and RNA-seq datasets.
  • Phenotype association: Assess the biological relevance of enriched pathways relative to phenotypes using curated phenotype-rich collections.
  • Method development: Benchmark domain-specific algorithms and guide methodological improvements through standardized evaluation.

Methodology:

Systematic evaluation of enrichment methods using large-scale expression compendia from microarray and RNA-seq studies; efficient parallel execution on local, high-performance, and grid computing environments; standardized execution pipelines with uniform input handling and structured result objects; computation of metrics including runtime, detection of statistically significant gene sets, and biological relevance of enriched pathways.

Topics

Collections

Details

License:
Artistic-2.0
Tool Type:
library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
R
Added:
7/9/2018
Last Updated:
12/10/2018

Operations

Documentation

Downloads

Links