Taipan

Taipan assembles short reads from second-generation sequencing using a hybrid greedy-extension and read-graph approach to produce high-quality contigs while optimizing computational resources.


Key Features:

  • Hybrid Assembly Approach: Integrates greedy extension and graph-based assembly principles to balance assembly speed and decision quality.
  • Greedy Extensions for Contig Construction: Uses greedy extensions to extend overlapping reads into contigs for rapid initial assembly.
  • Incorporation of Read Graphs: Constructs portions of the read graph at each assembly step to inform subsequent assembly decisions without building the full graph.
  • Resource Efficiency: Produces assemblies comparable in quality to graph-based tools such as Edena and Velvet while requiring moderate computational resources.

Scientific Applications:

  • De novo genome assembly: Assembles genomes from short reads generated by second-generation sequencing technologies.
  • Large-scale short-read datasets: Processes massive short-read datasets to deliver high-quality assemblies with constrained computational cost.
  • Genomics and evolutionary biology: Provides contiguous sequences suitable for analyses in genomics and evolutionary biology that require accurate assemblies.

Methodology:

Taipan applies greedy extensions to build contigs while selectively constructing portions of the read graph at each assembly step to guide assembly decisions.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux, Mac
Programming Languages:
C
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Publications

Schmidt B, Sinha R, Beresford-Smith B, Puglisi SJ. A fast hybrid short read fragment assembly algorithm. Bioinformatics. 2009;25(17):2279-2280. doi:10.1093/bioinformatics/btp374. PMID:19535537.