scGEAToolbox
scGEAToolbox provides computational analysis of single-cell RNA sequencing (scRNA-seq) data to quantify cell heterogeneity, gene expression variability, trajectories/pseudotime, and gene regulatory networks.
Key Features:
- Data Normalization: Functions to normalize scRNA-seq count data for comparable metrics across samples and batches.
- Feature Selection: Methods to select informative genes/features for dimensionality reduction and downstream analysis.
- Batch Correction: Procedures to mitigate batch effects that confound biological variation.
- Imputation: Techniques to impute missing or zero-inflated values in sparse scRNA-seq data.
- Cell Clustering: Algorithms to identify distinct cell populations from single-cell expression profiles.
- Trajectory/Pseudotime Analysis: Tools for inferring developmental trajectories and ordering cells along pseudotime.
- Network Construction: Functions to construct and analyze gene regulatory networks from single-cell data.
Scientific Applications:
- Cell heterogeneity analysis: Characterizing cellular diversity and gene expression variability within heterogeneous samples using scRNA-seq data.
- Developmental trajectory and pseudotime inference: Studying dynamic biological processes and developmental pathways by ordering cells in pseudotime.
- Gene regulatory network reconstruction: Inferring regulatory interactions and network structure among genes from single-cell expression profiles.
- Preprocessing for downstream analyses: Enabling normalization, batch correction, and imputation to improve the reliability of subsequent analyses.
Methodology:
Functions are implemented in native MATLAB, with wrapper functions to interface with third-party tools developed in MATLAB or other programming languages.
Topics
Details
- License:
- GPL-3.0
- Programming Languages:
- MATLAB
- Added:
- 1/14/2020
- Last Updated:
- 12/17/2020
Operations
Publications
Cai JJ. scGEAToolbox: a Matlab toolbox for single-cell RNA sequencing data analysis. Bioinformatics. 2019;36(6):1948-1949. doi:10.1093/bioinformatics/btz830. PMID:31697351.
PMID: 31697351
Funding: - Texas A&M University T3 grant and National Institutes of Health: R21AI126219