SCALE

SCALE extracts interpretable latent features from single-cell ATAC-seq (scATAC-seq) chromatin accessibility data to enable denoising, clustering, visualization, and identification of batch effects.


Key Features:

  • Deep Generative Framework: Employs a deep generative framework to model high-dimensional, sparse scATAC-seq signals and extract latent features representing chromatin accessibility.
  • Probabilistic Gaussian Mixture Model (GMM): Integrates a probabilistic Gaussian mixture model to characterize latent-feature distributions and support clustering and denoising.
  • Interpretable Features: Produces interpretable latent features that correlate with specific cell populations and chromatin accessibility patterns.
  • Batch Effect Identification: Detects and highlights batch effects across scATAC-seq experiments.

Scientific Applications:

  • Visualization: Provides low-dimensional embeddings for visualizing chromatin accessibility landscapes from scATAC-seq data.
  • Clustering: Enables identification of distinct cell populations via latent-space clustering of scATAC-seq profiles.
  • Denoising and Imputation: Performs denoising and imputes missing accessibility signals to improve downstream analyses of scATAC-seq datasets.

Methodology:

Implemented in PyTorch and based on a deep generative framework combined with a probabilistic Gaussian mixture model for latent feature learning.

Topics

Details

License:
MIT
Programming Languages:
Python
Added:
1/9/2020
Last Updated:
12/17/2020

Operations

Publications

Xiong L, Xu K, Tian K, Shao Y, Tang L, Gao G, Zhang M, Jiang T, Zhang QC. SCALE method for single-cell ATAC-seq analysis via latent feature extraction. Nature Communications. 2019;10(1). doi:10.1038/s41467-019-12630-7. PMID:31594952. PMCID:PMC6783552.

PMID: 31594952
PMCID: PMC6783552
Funding: - Ministry of Science and Technology of the People's Republic of China: 2018YFA0107603 - National Natural Science Foundation of China: 31621063, 31761163007, 91740204