CellBender
CellBender removes background ambient RNA and technical noise from droplet-based single-cell assays (scRNA-seq, snRNA-seq, CITE-seq) by applying an unsupervised deep generative model to produce denoised gene expression counts for improved downstream analysis.
Key Features:
- Deep generative model: A model structured to reflect the phenomenology of background noise generation in droplet-based assays.
- Unsupervised droplet classification: Distinguishes cell-containing and cell-free droplets without prior labeling.
- Background noise modeling: Learns the profile of ambient RNA and off-target gene expression across droplets.
- End-to-end denoising: Retrieves noise-free quantification of gene expression through an end-to-end process.
- Supported modalities: Explicit support for droplet-based scRNA-seq, snRNA-seq, and CITE-seq data.
- Performance on simulations: Performance approaches the theoretically optimal denoising limit on simulated datasets.
- Scalability and robustness: Implemented to scale to large droplet-based datasets while maintaining robust denoising.
- Biological insight: The learned background profile can provide insight into degraded or uncaptured cell types.
- Reduction of technical artifacts: Mitigates batch effects and spurious differential gene expression signals caused by ambient/background counts.
Scientific Applications:
- Ambient RNA removal: Removing ambient and technical background from droplet-based scRNA-seq, snRNA-seq, and CITE-seq datasets.
- Improved differential expression: Reducing spurious differential gene expression results caused by background counts.
- Batch-effect mitigation: Decreasing batch effects arising from inconsistent ambient contamination across experiments.
- Data alignment across modalities: Enhancing alignment of droplet-based single-cell data with established gene expression patterns across tissues, protocols, and modalities.
- Detection of degraded/uncaptured cells: Using background profiles to infer presence of degraded or uncaptured cell types.
Methodology:
An unsupervised deep generative model structured to reflect background noise generation is trained to distinguish cell-containing from cell-free droplets, learn a background noise profile, and output denoised gene expression counts via an end-to-end process.
Topics
Details
- License:
- BSD-3-Clause
- Programming Languages:
- Python
- Added:
- 1/9/2020
- Last Updated:
- 12/10/2020
Operations
Publications
Fleming SJ, Chaffin MD, Arduini A, Akkad A, Banks E, Marioni JC, Philippakis AA, Ellinor PT, Babadi M. Unsupervised removal of systematic background noise from droplet-based single-cell experiments using <tt>CellBender</tt>. Unknown Journal. 2019. doi:10.1101/791699.