BCFtools
BCFtools processes and analyzes variant-call data by reading and writing the Variant Call Format (VCF) and binary BCF to support variant calling, filtering, querying, statistical analysis, and effect analysis in high-throughput sequencing workflows.
Key Features:
- VCF/BCF I/O: Reads and writes VCF and BCF files, including BGZF-compressed VCFs/BCFs.
- Format conversion: Converts between VCF and BCF file formats.
- Sorting: Sorts variant records within VCF/BCF files.
- Querying and filtering: Performs record-level queries and filtering of variant calls.
- Statistical analysis: Computes statistics on variant data.
- Variant calling: Performs variant calling from sequencing data.
- Effect analysis: Performs analysis of variant effects.
- Interoperability: Integrates with SAMtools and processes data alongside SAM-format read alignments.
- Sequencing platform support: Operates on variant data derived from high-throughput sequencing of short and long reads.
Scientific Applications:
- Population-scale variant analysis: Analysis of variant datasets from large projects such as the 1000 Genomes Project.
- Variant discovery and genotyping: Calling and genotyping variants from high-throughput sequencing data.
- Variant filtering and selection: Filtering and querying variants for downstream association, functional, or population-genetic analyses.
- Effect interpretation: Assessing predicted effects of variants for functional genomics studies.
Methodology:
Operations explicitly include reading/writing VCF and BCF (including BGZF-compressed files), conversion between VCF and BCF, sorting, record-level querying and filtering, statistical computations, variant calling, effect analysis, and interoperability with SAMtools for SAM-format alignments.
Topics
Collections
Details
- License:
- MIT
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- command-line tool, workflow
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- C
- Added:
- 8/24/2015
- Last Updated:
- 6/3/2025
Operations
Data Inputs & Outputs
Data handling
Inputs
Outputs
Publications
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078-2079. doi:10.1093/bioinformatics/btp352. PMID:19505943. PMCID:PMC2723002.
Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10(2). doi:10.1093/gigascience/giab008. PMID:33590861. PMCID:PMC7931819.
Documentation
Downloads
- Downloads pagehttp://www.htslib.org/download/