gapFinisher

gapFinisher is an automated pipeline to address the crucial yet challenging step of gap filling in de novo genome assembly, particularly in large genomes where unknown sequences or gaps persist in many published genomes across public databases. Despite the availability of various computational tools to solve the gap filling problem, many existing solutions fall short in terms of reliability and correctness of the output.

Leveraging the scaffolding capabilities of SSPACE-LongRead, which utilizes long reads from multiple third-generation sequencing platforms to find links between contigs and combine them, gapFinisher uniquely processes SSPACE-LongRead's output to fill gaps post-scaffolding efficiently. Integrating the controlled use of the previously published gap filling tool FGAP, gapFinisher operates effectively across all standard Linux/UNIX command lines, offering a seamless experience for users.

Through comparative analysis with two other published gap filling tools, PBJelly and GMcloser, gapFinisher demonstrated its ability to fill gaps in draft genomes quickly and reliably, setting it apart from its counterparts. Notably, its serial design allows gapFinisher to scale well from prokaryote genomes to larger ones without increasing the computational footprint, making it a versatile tool for varying-size genome assembly projects.

Topic

Sequence assembly;Workflows;Whole genome sequencing

Detail

  • Operation: Genome indexing;Read binning;Scaffolding

  • Software interface: Command-line user interface

  • Language: Shell

  • License: GNU Genral Public License v3

  • Cost: -

  • Version name: -

  • Credit: Jane and Aatos Erkko Foundation and Helsinki University Integrated Life Sciences doctoral programme.

  • Input: -

  • Output: -

  • Contact: Juhana I. Kammonen juhana.kammonen@helsinki.fi

  • Collection: -

  • Maturity: -

Publications

  • gapFinisher: A reliable gap filling pipeline for SSPACE-LongRead scaffolder output.
  • Kammonen JI, et al. gapFinisher: A reliable gap filling pipeline for SSPACE-LongRead scaffolder output. gapFinisher: A reliable gap filling pipeline for SSPACE-LongRead scaffolder output. 2019; 14:e0216885. doi: 10.1371/journal.pone.0216885
  • https://doi.org/10.1371/JOURNAL.PONE.0216885
  • PMID: 31498807
  • PMC: PMC6733440

Download and documentation


< Back to DB search