DSM
DSM (Distributed String Mining) is a software framework for content-based exploration and retrieval of whole-metagenome sequencing samples. It efficiently extracts informative sequence k-mers from a pool of metagenomic samples and uses them to measure the dissimilarity between two samples. The method is unsupervised but achieves significant enrichment for diseased gut samples in query results and high accuracy in discriminating between different body sites.
Topic
Sequencing;Metagenomics
Detail
Operation: Query and retrieval
Software interface: Command-line interface
Language: MATLAB,C++
License: Not stated
Cost: Free of charge
Version name: 20140113
Credit: The Academy of Finland, the Finnish Centre of Excellence in Computational Inference Research COIN.
Input: -
Output: -
Contact: -
Collection: -
Maturity: -
Publications
- Exploration and retrieval of whole-metagenome sequencing samples.
- Seth S, et al. Exploration and retrieval of whole-metagenome sequencing samples. Exploration and retrieval of whole-metagenome sequencing samples. 2014; 30:2471-9. doi: 10.1093/bioinformatics/btu340
- https://doi.org/10.1093/bioinformatics/btu340
- PMID: 24845653
- PMC: PMC4230234
Download and documentation
Source: https://github.com/HIITMetagenomics/dsm-framework/releases/tag/bioinformatics2014
Documentation: https://github.com/HIITMetagenomics/dsm-framework/blob/master/README.md
Home page: https://github.com/HIITMetagenomics/dsm-framework
< Back to DB search