gapFinisher
gapFinisher closes gaps in genome assemblies produced by SSPACE-LongRead by using long reads from third-generation sequencing platforms to improve completeness of de novo draft genomes.
Key Features:
- Integration with SSPACE-LongRead: Processes scaffold outputs from SSPACE-LongRead and uses sequence information from long reads generated by third-generation sequencing platforms to close gaps within scaffolds.
- Automated Pipeline: Automates gap-filling steps into a reproducible pipeline without manual intervention.
- Reliability and Correctness: Implements a controlled application of FGAP to prioritize accurate and dependable gap closure.
- Performance and Scalability: Uses a serial design that scales from prokaryotic genomes to larger genomes and has been reported to outperform PBJelly and GMcloser in speed and reliability.
- Platform Compatibility: Executes on standard Linux/UNIX command lines.
Scientific Applications:
- Gap closure in de novo assemblies: Closes unknown sequences in draft genomes and scaffolds produced by SSPACE-LongRead.
- Improved downstream analyses: Increases completeness for gene annotation, comparative genomics, and evolutionary studies by reducing scaffold gaps.
- Applicability across taxa: Applicable to microbial (prokaryotic) genomes and larger plant and animal genome projects.
Methodology:
Processes SSPACE-LongRead scaffold outputs and uses sequence information from long reads to perform gap filling via a controlled application of FGAP following a serial design.
Topics
Details
- Programming Languages:
- Shell
- Added:
- 11/14/2019
- Last Updated:
- 1/17/2021
Operations
Publications
Kammonen JI, Smolander O, Paulin L, Pereira PAB, Laine P, Koskinen P, Jernvall J, Auvinen P. gapFinisher: A reliable gap filling pipeline for SSPACE-LongRead scaffolder output. PLOS ONE. 2019;14(9):e0216885. doi:10.1371/journal.pone.0216885. PMID:31498807. PMCID:PMC6733440.
PMID: 31498807
PMCID: PMC6733440
Funding: - Jane ja Aatos Erkon Säätiö: 04-2013, 05-2017
- Integrated Life Sciences Doctoral Programme (ILS): 3-2016