Ragout
Ragout assembles bacterial genomes from high-throughput short-read sequencing assemblies by ordering and scaffolding contigs using multiple complete reference genomes and their evolutionary relationships to improve scaffold continuity.
Key Features:
- Multiple Reference Utilization: Employs multiple complete reference genomes and incorporates their evolutionary relationships via a phylogenetic tree to determine contig order.
- Genome Rearrangement Strategy: Applies genome rearrangement approaches to model structural differences among references and guide the ordering and assembly of contigs into scaffolds.
- Assembly Graph and Multi-Scale Synteny Blocks: Integrates the input assembly graph with multi-scale synteny blocks to address gaps caused by small contigs and to reduce assembly errors.
- Alternative to Long-Read Technologies: Provides a reference-based scaffolding alternative to long reads such as Pacific Biosciences (PacBio) or jumping libraries by leveraging existing complete genomes of related strains.
Scientific Applications:
- Improved bacterial genome assembly: Enhances draft assemblies from short-read sequencing for bacterial species with multiple related complete genomes, and in simulations and real datasets can yield a single high-quality scaffold per chromosome.
Methodology:
Uses multiple reference genomes and their phylogenetic relationships, applies genome rearrangement algorithms to order and scaffold contigs, and integrates the assembly graph with multi-scale synteny blocks to bridge gaps and reduce errors.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Python
- Added:
- 8/3/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Kolmogorov M, Raney B, Paten B, Pham S. Ragout—a reference-assisted assembly tool for bacterial genomes. Bioinformatics. 2014;30(12):i302-i309. doi:10.1093/bioinformatics/btu280. PMID:24931998. PMCID:PMC4058940.