BamHash

BamHash is a software tool for large resequencing projects that uses a checksum-based method to ensure that read pairs in FASTQ files match exactly the read pairs stored in BAM files. It can be used to verify the integrity of the files stored and discover any discrepancies, making it possible to determine if it is safe to delete the FASTQ files storing raw sequencing read after alignment, without the loss of data

Topic

Data quality management;DNA;Data management

Detail

  • Operation: Data handling

  • Software interface: Command-line user interface

  • Language: C++;C

  • License: GNU General Public License v3.0

  • Cost: Free

  • Version name: v2.0

  • Credit: -

  • Input: -

  • Output: -

  • Contact: Páll Melsted pmelsted@hi.is

  • Collection: -

  • Maturity: -

Publications

  • BamHash: a checksum program for verifying the integrity of sequence data.
  • Óskarsdóttir A, et al. BamHash: a checksum program for verifying the integrity of sequence data. BamHash: a checksum program for verifying the integrity of sequence data. 2016; 32:140-1. doi: 10.1093/bioinformatics/btv539
  • https://doi.org/10.1093/bioinformatics/btv539
  • PMID: 26363028
  • PMC: -

Download and documentation


< Back to DB search