GraphAligner

GraphAligner aligns long, error-prone sequencing reads to genome graphs to enable efficient and accurate mapping for applications such as error correction, genome assembly, and variant genotyping in pan-genome analyses.


Key Features:

  • Speed and Efficiency: Achieves approximately 12× faster alignment while using about one-fifth of the memory compared with similar tools, with performance comparable to aligning reads to linear reference genomes.
  • Accuracy: Produces substantially improved alignment accuracy, yielding nearly threefold improvements in error-correction applications over current methodologies.
  • Versatility: Handles long reads with high error rates, enabling alignment to complex graph-based representations of genomes.

Scientific Applications:

  • Error Correction: Aligns long reads to genome graphs to support correction of sequencing errors in long-read datasets.
  • Genome Assembly: Maps reads onto graph structures that represent genetic diversity to facilitate genome assembly.
  • Variant Genotyping: Enables genotyping of variants within pan-genome contexts by aligning reads to graphs that encode population variation.

Methodology:

GraphAligner employs a seed-and-extend algorithm tailored for genome graphs.

Topics

Details

License:
MIT
Tool Type:
command-line tool
Programming Languages:
C++
Added:
1/9/2020
Last Updated:
12/3/2020

Operations

Data Inputs & Outputs

De-novo assembly

Publications

Rautiainen M, Marschall T. GraphAligner: Rapid and Versatile Sequence-to-Graph Alignment. Unknown Journal. 2019. doi:10.1101/810812.

Links