BWA

BWA aligns short and long sequencing reads to large reference genomes (e.g., human) to produce accurate mappings for downstream analyses such as variant detection.


Key Features:

  • Algorithmic Efficiency: Implements the Burrows-Wheeler Transform (BWT) with backward search to enable rapid, memory-efficient alignment, reported to be ~10–20× faster than MAQ while maintaining similar accuracy.
  • Versatility in Read Types: Supports base-space reads from Illumina and color-space reads from AB SOLiD, accommodating multiple sequencing technologies.
  • Handling of Indels and Mismatches: Accommodates mismatches and gaps (indels) to improve accuracy for longer and error-prone reads.
  • Long-read Mode (BWA-SW): Provides a Smith–Waterman–based mode (BWA-SW) for high-accuracy alignments of longer sequences up to ~1 Mb.
  • Output Format: Produces alignments in SAM (Sequence Alignment/Map) format for integration with downstream tools such as SAMtools.
  • Scalability: Engineered to handle large-scale resequencing projects involving hundreds of individuals while managing computational resources.

Scientific Applications:

  • Whole Genome Sequencing (WGS): Maps reads from diverse sequencing platforms for genome-wide analyses.
  • Variant Calling: Provides aligned reads in SAM format as input for variant detection workflows and genotyping.
  • Comparative Genomics: Enables sensitive mapping for comparative studies across genomes, including complex genomes such as Plasmodium falciparum.

Methodology:

BWA uses the Burrows-Wheeler Transform with backward search for alignment of short and long reads and includes a Smith–Waterman based BWA-SW mode for long-sequence alignment up to ~1 Mb.

Topics

Collections

Details

License:
MIT
Maturity:
Mature
Tool Type:
command-line tool, workflow
Operating Systems:
Linux, Mac
Programming Languages:
C
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Data Inputs & Outputs

Read mapping

Other operations do not define inputs or outputs.

Publications

Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25(14):1754-1760. doi:10.1093/bioinformatics/btp324. PMID:19451168. PMCID:PMC2705234.

Hatem A, Bozdağ D, Toland AE, Çatalyürek ÜV. Benchmarking short sequence mapping tools. BMC Bioinformatics. 2013;14(1). doi:10.1186/1471-2105-14-184. PMID:23758764. PMCID:PMC3694458.

Thankaswamy-Kosalai S, Sen P, Nookaew I. Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics. Genomics. 2017;109(3-4):186-191. doi:10.1016/j.ygeno.2017.03.001. PMID:28286147.

PMID: 28286147
Funding: - Vetenskapsrådet: VR-2013-4504

Caboche S, Audebert C, Lemoine Y, Hot D. Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data. BMC Genomics. 2014;15(1):264. doi:10.1186/1471-2164-15-264. PMID:24708189. PMCID:PMC4051166.

Otto C, Stadler PF, Hoffmann S. Lacking alignments? The next-generation sequencing mapper segemehl revisited. Bioinformatics. 2014;30(13):1837-1843. doi:10.1093/bioinformatics/btu146. PMID:24626854.

Li H, Durbin R. Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics. 2010;26(5):589-595. doi:10.1093/bioinformatics/btp698. PMID:20080505. PMCID:PMC2828108.

Documentation

Downloads