RNA-Skim
RNA-Skim quantifies transcript-level abundances from RNA-Seq data using an alignment-free, sig-mer (k-mer) based approach.
Key Features:
- Alignment-Free Methodology: Eliminates read-to-reference alignment by using k-mer signatures for quantification.
- Partitioning into Transcript Clusters: Partitions the transcriptome into disjoint clusters based on sequence similarity.
- Introduction of Sig-Mers: Defines sig-mers as specific k-mers uniquely associated with each transcript cluster and uses them to inform abundance estimates.
- Parallel Processing Capability: Reduces the complex optimization problem into smaller tasks that can be processed independently and in parallel.
- Resource Efficiency: Operates using a small subset of k-mers (reported as <4%) and under 10% of the CPU time required by Sailfish, with >100-fold speedup versus alignment-based methods.
- High Accuracy: Produces transcript quantifications reported as comparable to or better than existing methods.
Scientific Applications:
- Gene expression studies: Provides transcript-level abundance estimates for analyses of gene expression.
- Differential expression analysis: Supplies quantifications suitable for comparing transcript abundances across conditions.
- Large-scale transcriptome profiling: Enables high-throughput transcript quantification for large datasets.
Methodology:
Partition the transcriptome into disjoint clusters by sequence similarity, identify sig-mers (specific k-mers uniquely associated with each cluster), use sig-mers to estimate transcript abundances within clusters while eliminating alignment, and reduce the optimization into smaller independent parallel tasks using a small subset of k-mers (reported <4%).
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Mac
- Programming Languages:
- C++
- Added:
- 8/3/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Zhang Z, Wang W. RNA-Skim: a rapid method for RNA-Seq quantification at transcript level. Bioinformatics. 2014;30(12):i283-i292. doi:10.1093/bioinformatics/btu288. PMID:24931995. PMCID:PMC4058932.
Documentation
General
http://www.csbio.unc.edu/rs/