RNASeqPowerCalculator
RNASeqPowerCalculator estimates statistical power and optimizes RNA-Seq experimental design for detecting differential expression under a negative binomial model.
Key Features:
- Power Analysis: Simulates RNA-Seq count data using parameters derived from six diverse public datasets, including cell line comparisons, tissue comparisons, and cancer studies, to calculate statistical power for paired and unpaired experiments.
- Performance Evaluation: Evaluates five differential expression analysis packages (DESeq, edgeR, DESeq2, sSeq, EBSeq) across metrics including statistical power, ROC curves, AUC, Matthews correlation coefficient (MCC), and F-measures, identifying DESeq2 and edgeR as generally superior.
- Sample Size vs. Sequencing Depth: Quantifies how increasing sample size or sequencing depth affects power and shows that increasing sample size is more effective than increasing sequencing depth, particularly when depth exceeds 20 million reads.
- Gene Type Considerations: Reports that long intergenic noncoding RNAs (lincRNAs) have lower power than protein-coding mRNAs due to generally lower expression levels in RNA-Seq experiments.
- Paired-Sample Design: Demonstrates significant enhancement of statistical power through paired-sample RNA-Seq and emphasizes multifactor experimental design considerations.
- Budget-Constrained Optimization: Identifies a local optimal power achievable within budget constraints and shows sample size is the dominant factor influencing optimization relative to sequencing depth.
Scientific Applications:
- Experimental design planning: Guides allocation of sample size and sequencing depth for differential expression studies involving cell lines, tissues, and cancer types under budget constraints.
- Method benchmarking: Supports selection and comparison of differential expression analysis packages using power, ROC/AUC, MCC, and F-measure metrics.
- Design for specific targets: Informs design choices for studies targeting lincRNAs versus protein-coding mRNAs and for paired or multifactor experimental designs.
Methodology:
Simulates RNA-Seq count data under a negative binomial model using parameters from six public datasets and evaluates differential expression methods (DESeq, edgeR, DESeq2, sSeq, EBSeq) by computing statistical power, ROC curves, AUC, MCC, and F-measures.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Windows
- Programming Languages:
- R
- Added:
- 8/3/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Ching T, Huang S, Garmire LX. Power analysis and sample size estimation for RNA-Seq differential expression. RNA. 2014;20(11):1684-1696. doi:10.1261/rna.046011.114. PMID:25246651. PMCID:PMC4201821.
PMID: 25246651
PMCID: PMC4201821
Funding: - National Institute of General Medical Sciences: P20 COBRE GM103457