TextFormats
TextFormats is an open-source software library and toolset to simplify the definition and parsing of text formats commonly used in bioinformatics. The library allows researchers to easily create specifications for new text formats using a simple data description format and an interactive wizard. TextFormats also provides features for automatically generating data examples and testing specifications to ensure correctness. Once a format specification is created, the library enables parsing and writing data in that format using various programming languages such as Nim, Python, and C/C++, as well as through command line and graphical user interface tools. Although TextFormats is a general-purpose software, its main target application is bioinformatics, where new techniques, software tools, and data types often require the definition of new formats. The library includes specifications for several standard existing bioinformatics formats to facilitate their use.
Topic
Bioinformatics;Software engineering;Genomics
Detail
Operation: Parsing;Formatting;Editing
Software interface: Command-line interface
Language: Python,C,C++
License: Not stated
Cost: Free of charge
Version name: -
Credit: DFG
Input: -
Output: -
Contact: Giorgio Gonnella giorgio.gonnella@uni-goettingen.de
Collection: -
Maturity: -
Publications
- TextFormats: Simplifying the definition and parsing of text formats in bioinformatics.
- DFG Grant GO "Automated characterization of microbial genomes and metagenomes by collection and verification of association rules".
- https://doi.org/10.1371/JOURNAL.PONE.0268910
- PMID: 35617194
- PMC: PMC9135226
Download and documentation
Documentation: https://github.com/ggonnella/textformats/tree/main/manuals
Home page: https://github.com/ggonnella/textformats
< Back to DB search