Taipan
Taipan assembles short reads from second-generation sequencing using a hybrid greedy-extension and read-graph approach to produce high-quality contigs while optimizing computational resources.
Key Features:
- Hybrid Assembly Approach: Integrates greedy extension and graph-based assembly principles to balance assembly speed and decision quality.
- Greedy Extensions for Contig Construction: Uses greedy extensions to extend overlapping reads into contigs for rapid initial assembly.
- Incorporation of Read Graphs: Constructs portions of the read graph at each assembly step to inform subsequent assembly decisions without building the full graph.
- Resource Efficiency: Produces assemblies comparable in quality to graph-based tools such as Edena and Velvet while requiring moderate computational resources.
Scientific Applications:
- De novo genome assembly: Assembles genomes from short reads generated by second-generation sequencing technologies.
- Large-scale short-read datasets: Processes massive short-read datasets to deliver high-quality assemblies with constrained computational cost.
- Genomics and evolutionary biology: Provides contiguous sequences suitable for analyses in genomics and evolutionary biology that require accurate assemblies.
Methodology:
Taipan applies greedy extensions to build contigs while selectively constructing portions of the read graph at each assembly step to guide assembly decisions.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Mac
- Programming Languages:
- C
- Added:
- 1/13/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Schmidt B, Sinha R, Beresford-Smith B, Puglisi SJ. A fast hybrid short read fragment assembly algorithm. Bioinformatics. 2009;25(17):2279-2280. doi:10.1093/bioinformatics/btp374. PMID:19535537.
PMID: 19535537