Bazam
Bazam extracts original paired FASTQ reads from BAM and CRAM alignment files to enable realignment to updated reference genomes or alternative alignment methods and targeted region analyses.
Key Features:
- Paired FASTQ extraction: Extracts original paired FASTQ reads from BAM and CRAM alignment files.
- Selective regional extraction: Selectively extracts read pairs mapped to specified genomic regions.
- Realignment support: Produces FASTQ output suitable for realignment to updated reference genomes or with alternative alignment methods.
- Time efficiency: Reduces the time required for realignment by up to 90% compared to traditional approaches.
- Alignment comparison enablement: Facilitates comparative analyses by providing reads for re-alignment using different methods or references.
- Structural variant targeting: Allows extraction of reads from regions suspected to contain structural variants to aid detection and characterization.
- Region-specific quality control: Enables detailed quality control analyses by extracting reads from defined genomic loci.
Scientific Applications:
- Targeted Region Analyses: Focuses analyses on specific genomic intervals by extracting paired reads mapped to those regions.
- Quality Control: Supports region-specific examination of sequencing quality using extracted reads.
- Structural Variant Calling: Provides targeted read sets from candidate loci to assist structural variant identification and characterization.
- Alignment Comparisons and Reanalysis: Enables re-alignment of existing data to alternative alignment methods or updated reference genomes for comparative evaluation.
Methodology:
Extracts original paired FASTQ reads from BAM and CRAM files, supports selective extraction of read pairs from specified genomic regions, and outputs FASTQ files for realignment to updated reference genomes or alternative alignment methods.
Topics
Details
- License:
- LGPL-2.1
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Mac
- Programming Languages:
- Groovy
- Added:
- 8/3/2019
- Last Updated:
- 6/16/2020
Operations
Publications
Sadedin SP, Oshlack A. Bazam: a rapid method for read extraction and realignment of high-throughput sequencing data. Genome Biology. 2019;20(1). doi:10.1186/s13059-019-1688-1. PMID:30999943. PMCID:PMC6472072.
Documentation
Downloads
- Downloads pagehttps://github.com/ssadedin/bazam/releases
Links
Issue tracker
https://github.com/ssadedin/bazam/issues