Fec
'Fec' is a fast and efficient error correction tool designed for third-generation sequencing data, which often contains long reads with relatively high error rates. Error correction is a critical preprocessing step in the analysis of these long-read sequences. 'Fec' specifically addresses the challenges associated with error correction in high-coverage scenarios, where the process can be time-consuming.
The tool leverages overlapping reads to identify and correct errors. 'Fec' operates in two rounds, utilizing a combination of large and small window sizes. In the first round, a relatively large window size is used to quickly identify sufficient overlaps between reads, enabling the correction of most reads. In the second round, a smaller window size is employed to identify additional overlaps, focusing on reads that may not have been adequately corrected in the first round.
'Fec' uses caching for base-level alignment information. When performing alignment, 'Fec' first checks a cache for existing alignments. If the alignment is found in the cache, 'Fec' retrieves it and deduces the second alignment from it. This caching strategy significantly reduces the need for repeated alignment calculations, making the error correction process more efficient.
The Authors have tested the tool on nine datasets, including both PacBio and nanopore sequencing data, and the results demonstrate that 'Fec' offers a substantial speed improvement compared to other error correction methods like MECAT, CANU, MINICNS, and NECAT, especially in scenarios with high-coverage data.
'Fec' is available for use either as a standalone tool or as part of an assembly pipeline.
Topic
Sequence assembly;Structure analysis;Workflows
Detail
Operation: Read mapping;Sequence assembly
Software interface: Command-line user interface
Language: C++
License: Not stated
Cost: Free
Version name: v1.0.1
Credit: The National Key Research and Development Program of China, the National Natural Science Foundation of China.
Input: -
Output: -
Contact: Jianxin Wang jxwang@mail.csu.edu.cn
Collection: -
Maturity: -
Publications
- Fec: a fast error correction method based on two-rounds overlapping and caching.
- Zhang J, et al. Fec: a fast error correction method based on two-rounds overlapping and caching. Fec: a fast error correction method based on two-rounds overlapping and caching. 2022; 38:4629-4632. doi: 10.1093/bioinformatics/btac565
- https://doi.org/10.1093/BIOINFORMATICS/BTAC565
- PMID: 35977383
- PMC: -
Download and documentation
Documentation: https://github.com/zhangjuncsu/Fec/blob/main/README.md
Home page: https://github.com/zhangjuncsu/Fec
< Back to DB search