SR-ASM

SR-ASM assembles short DNA sequences produced by 454 pyrosequencing into longer contigs for genome assembly.


Key Features:

  • Heuristic Graph-Based Approach: Utilizes a heuristic method grounded in a modified graph model originally proposed for DNA sequencing by hybridization to assemble short reads into contigs.
  • Temporary Compression of Input Sequences: Temporarily compresses input sequences to reduce computational complexity and enhance processing speed without compromising accuracy.
  • Fast Multiple-Alignment Heuristics: Incorporates a rapid multiple-alignment heuristic that leverages the specific presentation and coding of data produced by 454 sequencing.

Scientific Applications:

  • Raw 454 data analysis: Validated on raw 454 sequencing data, demonstrating robustness and reliability.
  • Prochlorococcus marinus whole-genome assembly: Applied to whole-genome sequencing of Prochlorococcus marinus (1.84 Mbp).
  • Homo sapiens chromosome segment assembly: Applied to a segment of chromosome 15 from Homo sapiens.
  • Comparative benchmarking: In comparative analyses against other publicly available DNA assembly algorithms, produced a higher number of contigs and longer contigs with high similarity to reference genome sequences.

Methodology:

SR-ASM applies a heuristic modified graph model derived from sequencing-by-hybridization approaches, temporary compression of input sequences, and a fast multiple-alignment heuristic that leverages the presentation and coding of 454 pyrosequencing data.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
C++
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Publications

Blazewicz J, Bryja M, Figlerowicz M, Gawron P, Kasprzak M, Kirton E, Platt D, Przybytek J, Swiercz A, Szajkowski L. Whole genome assembly from 454 sequencing output via modified DNA graph concept. Computational Biology and Chemistry. 2009;33(3):224-230. doi:10.1016/j.compbiolchem.2009.04.005. PMID:19477687.

Documentation