CoCo
CoCo reassigns RNA-seq reads to account for nested genes and multi-mapped reads, improving transcript quantification in higher eukaryotic transcriptomes.
Key Features:
- Nested-gene-aware read assignment: Implements a read assignment pipeline that accounts for nested genes and overlapping annotations.
- Modified annotation: Uses a modified annotation file specifically designed to highlight nested genes.
- Multi-mapped read allocation: Proportionally allocates multi-mapped reads among repeated sequences.
- Read recovery: Recovers over 15% of previously discarded aligned RNA-seq reads.
- Impact on quantification: Alters abundance estimates for both coding and non-coding RNAs by redistributing reads.
- Validation: Validation reported using PCR and bedgraph comparisons.
Scientific Applications:
- Transcript quantification: Improves quantification accuracy in RNA-seq experiments from higher eukaryotic transcriptomes with nested and repetitive genes.
- Gene expression studies: Enhances reliability of coding and non-coding RNA abundance estimates for gene expression analyses.
- Next-generation sequencing analyses: Applies to RNA-seq datasets generated by next-generation sequencing technologies where multi-mapped reads and nested genes affect read assignment.
Methodology:
A read assignment pipeline uses a modified annotation file to mark nested genes and proportionally allocates multi-mapped reads among repeated sequences.
Topics
Details
- License:
- GPL-3.0
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Python
- Added:
- 8/9/2019
- Last Updated:
- 11/24/2024
Operations
Publications
Deschamps-Francoeur G, Boivin V, Abou Elela S, Scott MS. CoCo: RNA-seq read assignment correction for nested genes and multimapped reads. Bioinformatics. 2019;35(23):5039-5047. doi:10.1093/bioinformatics/btz433. PMID:31141144. PMCID:PMC6901076.