DSM

DSM (Distributed String Mining) is a software framework for content-based exploration and retrieval of whole-metagenome sequencing samples. It efficiently extracts informative sequence k-mers from a pool of metagenomic samples and uses them to measure the dissimilarity between two samples. The method is unsupervised but achieves significant enrichment for diseased gut samples in query results and high accuracy in discriminating between different body sites.

Topic

Sequencing;Metagenomics

Detail

  • Operation: Query and retrieval

  • Software interface: Command-line interface

  • Language: MATLAB,C++

  • License: Not stated

  • Cost: Free of charge

  • Version name: 20140113

  • Credit: The Academy of Finland, the Finnish Centre of Excellence in Computational Inference Research COIN.

  • Input: -

  • Output: -

  • Contact: -

  • Collection: -

  • Maturity: -

Publications

  • Exploration and retrieval of whole-metagenome sequencing samples.
  • Seth S, et al. Exploration and retrieval of whole-metagenome sequencing samples. Exploration and retrieval of whole-metagenome sequencing samples. 2014; 30:2471-9. doi: 10.1093/bioinformatics/btu340
  • https://doi.org/10.1093/bioinformatics/btu340
  • PMID: 24845653
  • PMC: PMC4230234

Download and documentation


< Back to DB search