Rsubread

Rsubread maps sequencing reads to reference genomes using a multi-seed "seed-and-vote" strategy to enable efficient, sensitive, and accurate alignment for genomic and transcriptomic analysis.


Key Features:

  • Seed-and-vote strategy: Extracts multiple short seeds (subreads) from each read and uses their votes to determine the overall genomic location for the read.
  • Subread extraction: Derives multiple subreads from each sequencing read to provide redundant mapping evidence.
  • Preselection for efficiency: Selects the candidate genomic region before detailed alignment to speed up mapping.
  • Mismatch and indel refinement: Applies conventional alignment algorithms to refine mismatches and indels within the preselected region.
  • Sensitivity and flexibility: Does not require individual subreads to map exactly or be in close proximity, allowing accommodation of mapping variation.
  • Consensus-based accuracy: Requires support from multiple subreads for a final mapped location, improving mapping accuracy.
  • Exon junction detection: Identifies exon-exon junctions by detecting reads whose subreads map to different exons within the same gene.
  • Scalability for long reads: Maintains performance and accuracy as read length increases.

Scientific Applications:

  • Quality assessment: Assesses sequence read quality to inform downstream analysis.
  • Read summarization: Summarizes mapped read data for interpretation of sequencing results.
  • Indel detection: Detects short and long insertions and deletions during alignment refinement.
  • Expression calling: Supports absolute expression quantification for transcriptomic studies.
  • SNP calling: Enables detection of single nucleotide polymorphisms from aligned reads.

Methodology:

Performs a two-step alignment: (1) uses the seed-and-vote strategy where subreads vote to determine the genomic location of reads, and (2) applies conventional alignment algorithms to refine mismatches and indels within the selected region.

Topics

Collections

Details

License:
GPL-3.0
Tool Type:
command-line tool, library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
R
Added:
1/17/2017
Last Updated:
5/13/2021

Operations

Publications

Liao Y, Smyth GK, Shi W. The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Research. 2013;41(10):e108-e108. doi:10.1093/nar/gkt214. PMID:23558742. PMCID:PMC3664803.

Documentation

Downloads

Links