BWA
BWA aligns short and long sequencing reads to large reference genomes (e.g., human) to produce accurate mappings for downstream analyses such as variant detection.
Key Features:
- Algorithmic Efficiency: Implements the Burrows-Wheeler Transform (BWT) with backward search to enable rapid, memory-efficient alignment, reported to be ~10–20× faster than MAQ while maintaining similar accuracy.
- Versatility in Read Types: Supports base-space reads from Illumina and color-space reads from AB SOLiD, accommodating multiple sequencing technologies.
- Handling of Indels and Mismatches: Accommodates mismatches and gaps (indels) to improve accuracy for longer and error-prone reads.
- Long-read Mode (BWA-SW): Provides a Smith–Waterman–based mode (BWA-SW) for high-accuracy alignments of longer sequences up to ~1 Mb.
- Output Format: Produces alignments in SAM (Sequence Alignment/Map) format for integration with downstream tools such as SAMtools.
- Scalability: Engineered to handle large-scale resequencing projects involving hundreds of individuals while managing computational resources.
Scientific Applications:
- Whole Genome Sequencing (WGS): Maps reads from diverse sequencing platforms for genome-wide analyses.
- Variant Calling: Provides aligned reads in SAM format as input for variant detection workflows and genotyping.
- Comparative Genomics: Enables sensitive mapping for comparative studies across genomes, including complex genomes such as Plasmodium falciparum.
Methodology:
BWA uses the Burrows-Wheeler Transform with backward search for alignment of short and long reads and includes a Smith–Waterman based BWA-SW mode for long-sequence alignment up to ~1 Mb.
Topics
Collections
Details
- License:
- MIT
- Maturity:
- Mature
- Tool Type:
- command-line tool, workflow
- Operating Systems:
- Linux, Mac
- Programming Languages:
- C
- Added:
- 1/13/2017
- Last Updated:
- 11/25/2024
Operations
Data Inputs & Outputs
Genome indexing
Outputs
Read mapping
Inputs
Outputs
Generation
Outputs
Generation
Outputs
Publications
Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25(14):1754-1760. doi:10.1093/bioinformatics/btp324. PMID:19451168. PMCID:PMC2705234.
Hatem A, Bozdağ D, Toland AE, Çatalyürek ÜV. Benchmarking short sequence mapping tools. BMC Bioinformatics. 2013;14(1). doi:10.1186/1471-2105-14-184. PMID:23758764. PMCID:PMC3694458.
Thankaswamy-Kosalai S, Sen P, Nookaew I. Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics. Genomics. 2017;109(3-4):186-191. doi:10.1016/j.ygeno.2017.03.001. PMID:28286147.
Caboche S, Audebert C, Lemoine Y, Hot D. Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data. BMC Genomics. 2014;15(1):264. doi:10.1186/1471-2164-15-264. PMID:24708189. PMCID:PMC4051166.
Otto C, Stadler PF, Hoffmann S. Lacking alignments? The next-generation sequencing mapper segemehl revisited. Bioinformatics. 2014;30(13):1837-1843. doi:10.1093/bioinformatics/btu146. PMID:24626854.
Li H, Durbin R. Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics. 2010;26(5):589-595. doi:10.1093/bioinformatics/btp698. PMID:20080505. PMCID:PMC2828108.
Documentation
Downloads
- Container filehttps://docker-ui.genouest.org/app/#/container/bioconda/bwa
- Container filehttps://anaconda.org/bioconda/bwa
- Tool wrapper (CWL)https://gitlab.com/sibyllewohlgemuth/cwl_files/raw/master/bwa_index.cwl
- Tool wrapper (CWL)https://gitlab.com/sibyllewohlgemuth/cwl_files/raw/master/bwa_mem.cwl