RResolver

RResolver improves de novo genome assembly by optimizing de Bruijn graph resolution of repeat regions in short-read sequencing data.


Key Features:

  • De Bruijn Graph Optimization: Operates on a de Bruijn graph produced by ABySS, using k-mers and adjusting the k value toward the read length to improve resolution of repeat regions.
  • Bloom Filter Integration: Employs a Bloom filter of sequencing reads to rapidly assess path support at branching points and remove paths with insufficient support.
  • Scalability and Performance: Demonstrated assembly of a human genome with ABySS in approximately 26 minutes using 48 threads and 60 GiB of memory, enabling processing of large genomes.
  • Improved Assembly Outcomes: Increases scaffold contiguity (NGA50) by up to 15% and reduces misassemblies by up to 12%.
  • Integration with ABySS: Functions within the ABySS assembly pipeline to enhance graph traversal and provide a simplified genomic representation for downstream algorithms.
  • Targeting High-Variation Genomes: Addresses assembly challenges in highly variable genomes, such as those encountered in cancer studies, by optimizing repeat resolution in short-read data.

Scientific Applications:

  • Cancer genomics: Improves de novo assemblies of highly variable cancer genomes to support variant and structural analyses.
  • Large-genome de novo assembly: Enhances contiguity and accuracy of de novo assemblies for large genomes generated from short-read sequencing and ABySS.
  • Downstream genomic analyses: Produces more contiguous and accurate assemblies that facilitate downstream analyses requiring high scaffold continuity.

Methodology:

Operates on an ABySS-derived de Bruijn graph built from k-mers of short reads, adjusts k toward read length to resolve repeats, and uses a Bloom filter of sequencing reads to evaluate path support at branching points and remove unsupported paths.

Topics

Details

License:
Not licensed
Cost:
Free of charge
Tool Type:
command-line tool
Operating Systems:
Mac, Linux, Windows
Added:
9/3/2022
Last Updated:
11/24/2024

Operations

Publications

Nikolić V, Afshinfard A, Chu J, Wong J, Coombe L, Nip KM, Warren RL, Birol I. RResolver: efficient short-read repeat resolution within ABySS. BMC Bioinformatics. 2022;23(1). doi:10.1186/s12859-022-04790-z. PMID:35729491. PMCID:PMC9215042.

PMID: 35729491
PMCID: PMC9215042
Funding: - Genome British Columbia: [281ANV] - Genome Canada: [281ANV] - National Institutes of Health: [2R01HG007182-04A1]

Related Tools

abyss
Relation: uses