edgeR
edgeR performs differential expression analysis of RNA-seq and other genomic count-based data (e.g., ChIP-seq, SAGE, CAGE) by using negative binomial models to estimate gene-wise variability and test for expression changes across complex experimental designs.
Key Features:
- Statistical Framework: Implements negative binomial models that accommodate complex experimental designs with multiple treatment conditions and blocking variables to model biological and technical variation.
- Empirical Bayes Methods: Uses empirical Bayes approaches to estimate and moderate gene-specific dispersions, improving variability estimates with limited biological replicates.
- Generalized Linear Models (GLMs): Supports GLM-based testing with genewise dispersions for paired designs and identification of condition- or tumour-specific changes.
- Handling Overdispersion: Models overdispersion via an overdispersed Poisson/negative binomial approach with empirical Bayes moderation of dispersion estimates.
- Parallel Computational Approaches: Applies parallel computation to facilitate non-linear model fitting and to scale GLM applications to genomic datasets.
- Simulations and Estimators: Uses simulations to assess adjusted profile likelihood estimators and provides empirical Bayes estimators that balance common-dispersion and genewise-dispersion assumptions.
Scientific Applications:
- RNA-seq Analysis: Identifies differentially expressed genes and pathways from RNA-seq count data.
- ChIP-seq and Other Genomic Data: Applies count-based statistical methods to ChIP-seq, SAGE, CAGE and other genomic count data types, including analyses related to epigenetic marks and DNA methylation.
- Pathway Analysis: Facilitates downstream pathway-level analyses and visualization of differential expression results.
Methodology:
Integrates with R and Bioconductor, accepts counts from read alignment (e.g., Rsubread) and count quantification, and performs differential expression analysis using quasi-likelihood methods based on negative binomial models.
Topics
Collections
Details
- License:
- GPL-2.0
- Tool Type:
- command-line tool, library
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- R
- Added:
- 1/17/2017
- Last Updated:
- 6/21/2022
Operations
Publications
Robinson MD, McCarthy DJ, Smyth GK. <tt>edgeR</tt> : a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2009;26(1):139-140. doi:10.1093/bioinformatics/btp616. PMID:19910308. PMCID:PMC2796818.
Chen Y, Lun ATL, Smyth GK. From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline. F1000Research. 2016;5:1438. doi:10.12688/f1000research.8987.2. PMID:27508061. PMCID:PMC4934518.
McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012;40(10):4288-4297. doi:10.1093/nar/gks042. PMID:22287627. PMCID:PMC3378882.