edgeR

edgeR performs differential expression analysis of RNA-seq and other genomic count-based data (e.g., ChIP-seq, SAGE, CAGE) by using negative binomial models to estimate gene-wise variability and test for expression changes across complex experimental designs.


Key Features:

  • Statistical Framework: Implements negative binomial models that accommodate complex experimental designs with multiple treatment conditions and blocking variables to model biological and technical variation.
  • Empirical Bayes Methods: Uses empirical Bayes approaches to estimate and moderate gene-specific dispersions, improving variability estimates with limited biological replicates.
  • Generalized Linear Models (GLMs): Supports GLM-based testing with genewise dispersions for paired designs and identification of condition- or tumour-specific changes.
  • Handling Overdispersion: Models overdispersion via an overdispersed Poisson/negative binomial approach with empirical Bayes moderation of dispersion estimates.
  • Parallel Computational Approaches: Applies parallel computation to facilitate non-linear model fitting and to scale GLM applications to genomic datasets.
  • Simulations and Estimators: Uses simulations to assess adjusted profile likelihood estimators and provides empirical Bayes estimators that balance common-dispersion and genewise-dispersion assumptions.

Scientific Applications:

  • RNA-seq Analysis: Identifies differentially expressed genes and pathways from RNA-seq count data.
  • ChIP-seq and Other Genomic Data: Applies count-based statistical methods to ChIP-seq, SAGE, CAGE and other genomic count data types, including analyses related to epigenetic marks and DNA methylation.
  • Pathway Analysis: Facilitates downstream pathway-level analyses and visualization of differential expression results.

Methodology:

Integrates with R and Bioconductor, accepts counts from read alignment (e.g., Rsubread) and count quantification, and performs differential expression analysis using quasi-likelihood methods based on negative binomial models.

Topics

Collections

Details

License:
GPL-2.0
Tool Type:
command-line tool, library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
R
Added:
1/17/2017
Last Updated:
6/21/2022

Operations

Publications

Robinson MD, McCarthy DJ, Smyth GK. <tt>edgeR</tt> : a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2009;26(1):139-140. doi:10.1093/bioinformatics/btp616. PMID:19910308. PMCID:PMC2796818.

Chen Y, Lun ATL, Smyth GK. From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline. F1000Research. 2016;5:1438. doi:10.12688/f1000research.8987.2. PMID:27508061. PMCID:PMC4934518.

McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012;40(10):4288-4297. doi:10.1093/nar/gks042. PMID:22287627. PMCID:PMC3378882.

Documentation

Downloads

Links