svapls

svapls adjusts for hidden sources of variability in gene expression studies by extracting sample-specific heterogeneity signals to improve the accuracy of differential expression testing.


Key Features:

  • Identification of Hidden Sources of Heterogeneity: Identifies unknown sample-specific sources of heterogeneity, including subject-specific effects and residual variability, that can confound gene expression measurements.
  • Partial Least Squares Regression: Uses Partial Least Squares (PLS) regression to extract hidden signals associated with sample-specific heterogeneity.
  • Improved Detection Power: Demonstrates improved detection power and reduced error rates through sensitivity analyses on simulated data with diverse patterns of hidden variation.
  • Application to Real Datasets and Batch Effect Detection: Validates performance on real-life datasets such as the Golub dataset and identifies potential batch effects in gene expression data.

Scientific Applications:

  • Differential Gene Expression Testing: Improves the accuracy of differential gene expression testing by adjusting for hidden variability.
  • Genome-wide Testing and Transcript Discovery: Supports genome-wide testing to identify transcripts associated with phenotypes by reducing confounding from sample-specific heterogeneity.
  • Batch Effect Detection: Detects potential batch effects in gene expression datasets to inform downstream analyses.

Methodology:

Identifies unknown sample-specific heterogeneity, extracts hidden signals using Partial Least Squares (PLS) regression, performs sensitivity analyses on simulated datasets, and validates findings on the Golub dataset.

Topics

Details

Tool Type:
library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
R
Added:
5/21/2018
Last Updated:
12/10/2018

Operations

Publications

Chakraborty S, Datta S, Datta S. svapls: an R package to correct for hidden factors of variability in gene expression studies. BMC Bioinformatics. 2013;14(1). doi:10.1186/1471-2105-14-236. PMID:23883280. PMCID:PMC3733742.

Documentation