QVZ
QVZ is a lossy compression algorithm for quality values in genomic data files, such as FASTQ and SAM files. The proposed algorithm exhibits better rate-distortion performance than previously proposed algorithms, for several distortion metrics and for the lossless case. The user can define any quasi-convex distortion function to be minimized, a feature not supported by the previous algorithms. Additionally, QVZ-compressed data exhibit better performance in genotyping than data compressed with previously proposed algorithms.
Topic
DNA;Sequencing;Sequence analysis;Data quality management
Detail
Operation: Sequencing quality control
Software interface: Command-line user interface
Language: C
License: GNU General Public License v3.0
Cost: Free
Version name: -
Credit: Stanford Graduate Fellowships Program in Science and Engineering, the Basque Government, the Center for Science of Information (CSoI), NSF, NIH.
Input: -
Output: -
Contact: iochoa@stanford.edu;gmalysa@stanford.edu;mhernaez@stanford.edu
Collection: -
Maturity: -
Publications
- QVZ: lossy compression of quality values.
- Malysa G, et al. QVZ: lossy compression of quality values. QVZ: lossy compression of quality values. 2015; 31:3122-9. doi: 10.1093/bioinformatics/btv330
- https://doi.org/10.1093/bioinformatics/btv330
- PMID: 26026138
- PMC: PMC5856090
Download and documentation
Documentation: https://github.com/mikelhernaez/qvz#readme
Home page: https://github.com/mikelhernaez/qvz
< Back to DB search