BCFtools

BCFtools processes and analyzes variant-call data by reading and writing the Variant Call Format (VCF) and binary BCF to support variant calling, filtering, querying, statistical analysis, and effect analysis in high-throughput sequencing workflows.


Key Features:

  • VCF/BCF I/O: Reads and writes VCF and BCF files, including BGZF-compressed VCFs/BCFs.
  • Format conversion: Converts between VCF and BCF file formats.
  • Sorting: Sorts variant records within VCF/BCF files.
  • Querying and filtering: Performs record-level queries and filtering of variant calls.
  • Statistical analysis: Computes statistics on variant data.
  • Variant calling: Performs variant calling from sequencing data.
  • Effect analysis: Performs analysis of variant effects.
  • Interoperability: Integrates with SAMtools and processes data alongside SAM-format read alignments.
  • Sequencing platform support: Operates on variant data derived from high-throughput sequencing of short and long reads.

Scientific Applications:

  • Population-scale variant analysis: Analysis of variant datasets from large projects such as the 1000 Genomes Project.
  • Variant discovery and genotyping: Calling and genotyping variants from high-throughput sequencing data.
  • Variant filtering and selection: Filtering and querying variants for downstream association, functional, or population-genetic analyses.
  • Effect interpretation: Assessing predicted effects of variants for functional genomics studies.

Methodology:

Operations explicitly include reading/writing VCF and BCF (including BGZF-compressed files), conversion between VCF and BCF, sorting, record-level querying and filtering, statistical computations, variant calling, effect analysis, and interoperability with SAMtools for SAM-format alignments.

Topics

Collections

Details

License:
MIT
Maturity:
Mature
Cost:
Free of charge
Tool Type:
command-line tool, workflow
Operating Systems:
Linux, Windows, Mac
Programming Languages:
C
Added:
8/24/2015
Last Updated:
6/3/2025

Operations

Data Inputs & Outputs

Data handling

Publications

Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078-2079. doi:10.1093/bioinformatics/btp352. PMID:19505943. PMCID:PMC2723002.

Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10(2). doi:10.1093/gigascience/giab008. PMID:33590861. PMCID:PMC7931819.

PMID: 33590861
PMCID: PMC7931819
Funding: - Wellcome Trust: 206194

Documentation

Downloads

Links

Related Tools

htslib
Relation: uses