Heap

Heap detects single nucleotide polymorphisms (SNPs) in low-coverage next-generation sequencing (NGS) data to provide sensitive and accurate variant calls for downstream analyses.


Key Features:

  • Low-Coverage Optimization: Optimized for low-coverage scenarios (e.g., ~7X coverage) to enhance SNP detection where read depth is limited.
  • Genotype-centric Calling: Performs genotype determination at each site and focuses calling on reliable positions to reduce false positives.
  • Minor-allele Support Filtering: Excludes sites with unreliable minor-allele support, specifically those supported by only one read or located at read ends.
  • Performance on RAD-seq: Achieves superior F-scores in low-coverage restriction-site associated DNA sequencing datasets from sorghum and rice individuals compared with existing tools.

Scientific Applications:

  • Genome-wide association studies (GWAS): Improves the quality and sensitivity of variant datasets used for association mapping from low-coverage NGS data.
  • Genomic prediction (GP): Enhances variant discovery for genomic prediction analyses where accurate SNP calls at low coverage benefit prediction models.

Methodology:

Aligns NGS reads to reference genome sequences and determines genotypes per site while excluding sites with insufficient minor-allele support (single-read support) or positional ambiguity at read ends prior to SNP calling.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux, Mac
Programming Languages:
Shell
Added:
5/30/2018
Last Updated:
11/25/2024

Operations

Publications

Kobayashi M, Ohyanagi H, Takanashi H, Asano S, Kudo T, Kajiya-Kanegae H, Nagano AJ, Tainaka H, Tokunaga T, Sazuka T, Iwata H, Tsutsumi N, Yano K. Heap: a highly sensitive and accurate SNP detection tool for low-coverage high-throughput sequencing data. DNA Research. 2017;24(4):397-405. doi:10.1093/dnares/dsx012. PMID:28498906. PMCID:PMC5737671.

Documentation