zUMIs
zUMIs processes single-cell RNA sequencing (scRNA-seq) data using sample-specific barcodes (BCs) and unique molecular identifiers (UMIs) to produce accurate gene expression counts and mitigate amplification bias.
Key Features:
- Barcode handling: Processes both known and random sample-specific barcodes (BCs).
- UMI collapsing: Collapses UMIs to deduplicate reads, applicable to exon-only or combined exon and intron mapping reads.
- Exon and intron mapping support: Counts reads mapping to exons and optionally to introns to include pre-mRNA signal.
- Intact cell detection: Detects intact cells from the distribution of sequencing reads when BC annotation is missing.
- Adaptive downsampling: Downsamples libraries to manage varying library sizes and to assess whether sequencing has reached saturation.
- Amplification-bias mitigation: Uses UMI-based deduplication to reduce PCR amplification bias in count estimates.
Scientific Applications:
- Single-cell gene expression quantification: Generates accurate per-cell gene expression counts for scRNA-seq analyses.
- Improved gene detection and clustering: Inclusion of intronic reads and UMI handling increases the number of detected genes and can improve cluster resolution.
- Single-nucleus RNA-seq analysis: Applicable to single-nucleus datasets, where over 35% of reads may map to introns, thereby increasing detected genes.
- Library saturation assessment: Uses adaptive downsampling to evaluate sequencing depth and compare libraries of different sizes.
Methodology:
Processes known and random BCs and UMIs, collapses UMIs for exon or exon+intron mapping reads, detects intact cells from read distributions, and performs adaptive downsampling; supports data from scRNA-seq protocols that use BCs and UMIs.
Topics
Details
- License:
- GPL-3.0
- Tool Type:
- command-line tool
- Programming Languages:
- R, Shell, Perl
- Added:
- 7/12/2018
- Last Updated:
- 4/25/2021
Operations
Publications
Parekh S, Ziegenhain C, Vieth B, Enard W, Hellmann I. zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs. GigaScience. 2018;7(6). doi:10.1093/gigascience/giy059. PMID:29846586. PMCID:PMC6007394.