scGEAToolbox

scGEAToolbox provides computational analysis of single-cell RNA sequencing (scRNA-seq) data to quantify cell heterogeneity, gene expression variability, trajectories/pseudotime, and gene regulatory networks.


Key Features:

  • Data Normalization: Functions to normalize scRNA-seq count data for comparable metrics across samples and batches.
  • Feature Selection: Methods to select informative genes/features for dimensionality reduction and downstream analysis.
  • Batch Correction: Procedures to mitigate batch effects that confound biological variation.
  • Imputation: Techniques to impute missing or zero-inflated values in sparse scRNA-seq data.
  • Cell Clustering: Algorithms to identify distinct cell populations from single-cell expression profiles.
  • Trajectory/Pseudotime Analysis: Tools for inferring developmental trajectories and ordering cells along pseudotime.
  • Network Construction: Functions to construct and analyze gene regulatory networks from single-cell data.

Scientific Applications:

  • Cell heterogeneity analysis: Characterizing cellular diversity and gene expression variability within heterogeneous samples using scRNA-seq data.
  • Developmental trajectory and pseudotime inference: Studying dynamic biological processes and developmental pathways by ordering cells in pseudotime.
  • Gene regulatory network reconstruction: Inferring regulatory interactions and network structure among genes from single-cell expression profiles.
  • Preprocessing for downstream analyses: Enabling normalization, batch correction, and imputation to improve the reliability of subsequent analyses.

Methodology:

Functions are implemented in native MATLAB, with wrapper functions to interface with third-party tools developed in MATLAB or other programming languages.

Topics

Details

License:
GPL-3.0
Programming Languages:
MATLAB
Added:
1/14/2020
Last Updated:
12/17/2020

Operations

Publications

Cai JJ. scGEAToolbox: a Matlab toolbox for single-cell RNA sequencing data analysis. Bioinformatics. 2019;36(6):1948-1949. doi:10.1093/bioinformatics/btz830. PMID:31697351.

PMID: 31697351
Funding: - Texas A&M University T3 grant and National Institutes of Health: R21AI126219