gutSMASH

gutSMASH identifies primary metabolic gene clusters (MGCs) in the human gut microbiota to predict and functionally profile microbial metabolic pathways that produce metabolites influencing host health.


Key Features:

  • Automated identification: Detects known and putative primary metabolic gene clusters encoded within microbial genomes.
  • Pathway prediction: Predicts 41 different known pathways encompassing bioenergetics and other primary metabolic processes and highlights putative pathways as discovery candidates.
  • Functional profiling: Provides functional profiling of predicted MGCs to infer metabolic capabilities related to metabolite production.
  • Input formats: Accepts GenBank assembly accession numbers or genome files in FASTA or GenBank formats.
  • Additional analyses: Offers optional analyses to further characterize predicted MGCs.

Scientific Applications:

  • Microbiome-Derived Metabolites: Investigating molecules produced by anaerobic bacteria that affect the host directly or indirectly.
  • Novel Pathway Discovery: Identifying candidate new metabolic pathways from putative MGCs.
  • Functional Genomics: Analyzing the functional potential of gut microbiomes to understand their roles in health and disease.

Methodology:

Accepts genomic input (GenBank assembly accession or FASTA/GenBank files), performs automated identification and functional profiling of primary MGCs, predicts 41 known pathways including bioenergetics and other primary metabolic processes, highlights putative pathways, and compares input genomes against a comprehensive database of known gene clusters using bioinformatics techniques.

Topics

Details

License:
AGPL-3.0
Tool Type:
web application
Programming Languages:
Python
Added:
9/8/2021
Last Updated:
11/24/2024

Operations

Publications

Pascal Andreu V, Roel-Touris J, Dodd D, Fischbach MA, Medema MH. The gutSMASH web server: automated identification of primary metabolic gene clusters from the gut microbiota. Nucleic Acids Research. 2021;49(W1):W263-W270. doi:10.1093/nar/gkab353. PMID:34019648. PMCID:PMC8262752.

PMID: 34019648
PMCID: PMC8262752
Funding: - DARPA: HR0011-15-C-0084, HR0112020030 - NIH: DP1 DK113598, K08 DK110335, P01 HL147823, R01 DK101674 - European Research Council: 948770-DECIPHER

Documentation

Links