MAFCO

MAFCO compresses MAF (Multiple Alignment Format) files using a lossless algorithm optimized for whole-genome multiple alignments to reduce storage footprint while preserving alignment data integrity.


Key Features:

  • Lossless compression: Implements a lossless algorithm specifically tailored for MAF files to preserve all alignment information.
  • MAF specialization: Targets MAF files that contain alignments between entire genomes across multiple species.
  • Compression performance vs gzip: Achieves compression gains ranging from 34% to 57% compared to gzip depending on the dataset.
  • Improvement over dedicated methods: Reports an average 9% compression improvement relative to other dedicated methods on compatible datasets.

Scientific Applications:

  • Archival storage of whole-genome alignments: Reduces storage requirements for MAF files generated in whole-genome comparative analyses.
  • Management of large comparative genomics datasets: Facilitates handling and long-term retention of large multiple-alignment datasets without data loss.
  • Preservation for downstream analyses: Maintains complete alignment information for subsequent comparative genomics and evolutionary analyses.

Methodology:

Uses a lossless compression algorithm tailored to MAF (Multiple Alignment Format) files and evaluates performance against gzip and other dedicated compression methods.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
C
Added:
8/3/2017
Last Updated:
11/25/2024

Operations

Publications

Matos LMO, Neves AJR, Pratas D, Pinho AJ. MAFCO: A Compression Tool for MAF Files. PLOS ONE. 2015;10(3):e0116082. doi:10.1371/journal.pone.0116082. PMID:25816229. PMCID:PMC4376647.

Documentation

Links