MVIB
MVIB learns joint stochastic encodings from species-relative abundances and strain-level markers derived from shotgun metagenomic sequencing to predict human disease states.
Key Features:
- Multimodal Integration: MVIB combines species-relative abundances and strain-level markers derived from shotgun metagenomic sequencing into a single predictive framework.
- Deep Learning Architecture: It employs an information-theoretic deep neural network to compute a joint stochastic encoding of heterogeneous input modalities.
- Performance and Efficiency: Reported Receiver Operating Characteristic Area Under Curve (ROC AUC) values range from 0.80 to 0.95 across disease cohorts, with average training times under 1.4 seconds.
- Interpretability: The model uses a saliency technique to identify the microbial species and strain-level markers most relevant to predictions.
- Scalability and Generalization: MVIB can incorporate additional modalities such as metabolomic data from mass spectrometry and shows cross-study generalization comparable to Random Forest when trained and tested on different cohorts of the same disease.
Scientific Applications:
- Disease prediction and biomarker discovery: Analysis of human gut metagenomic samples from 11 publicly available disease cohorts spanning six different conditions to predict diseases and identify associated microbial markers.
Methodology:
Training deep neural networks using a variational information bottleneck to maximize task-relevant information while minimizing irrelevant details and computing a joint stochastic encoding, with outputs interpreted via a saliency technique.
Topics
Details
- License:
- MIT
- Cost:
- Free of charge
- Tool Type:
- library
- Operating Systems:
- Mac, Linux, Windows
- Programming Languages:
- Python
- Added:
- 7/26/2022
- Last Updated:
- 11/24/2024
Operations
Publications
Grazioli F, Siarheyeu R, Alqassem I, Henschel A, Pileggi G, Meiser A. Microbiome-based disease prediction with multimodal variational information bottlenecks. PLOS Computational Biology. 2022;18(4):e1010050. doi:10.1371/journal.pcbi.1010050. PMID:35404958. PMCID:PMC9022840.