DeepREx
DeepREx predicts per-residue solvent exposure to characterize protein–solvent interactions and identify regions important for maintaining functional integrity during surface engineering when structural data are incomplete or absent.
Key Features:
- Deep learning-based residue classification: A deep learning algorithm classifies each residue in a protein sequence as buried or exposed.
- Training dataset from Protein Data Bank: The model is trained on a high-quality, non-redundant dataset derived from the Protein Data Bank comprising 2332 monomeric protein chains.
- Independent benchmarking: Performance is evaluated against an independent test set of 200 unrelated sequences, reporting state-of-the-art accuracy for solvent exposure prediction.
- Integrated sequence- and structure-derived features: Predictions incorporate residue conservation from multiple sequence alignments, local sequence hydrophobicity, residue flexibility predicted by MEDUSA, secondary structure prediction, and disordered region prediction from MobiDB-Lite3.0.
Scientific Applications:
- Protein engineering: Identification of exposed and buried regions to guide targeted mutations that modulate stability while preserving functional integrity.
- Analysis of proteins without experimental structures: Characterization of solvent exposure for sequences lacking complete structural data to inform experimental design and interpretation.
- Variant pathogenicity and functional annotation: Assessment of single-residue variations (SRVs) in human proteins, supporting analyses that link lower solvent accessibility to disease-associated variations.
Methodology:
Per-residue classification via a deep learning algorithm trained on a non-redundant PDB-derived dataset of 2332 monomeric chains and benchmarked on 200 unrelated sequences; integration of features from multiple sequence alignments (residue conservation), local sequence hydrophobicity calculations, residue flexibility predicted by MEDUSA, secondary structure prediction, and disordered region prediction using MobiDB-Lite3.0.
Topics
Collections
Details
- License:
- GPL-3.0
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- command-line tool, web application
- Operating Systems:
- Mac, Linux, Windows
- Programming Languages:
- Python
- Added:
- 5/15/2022
- Last Updated:
- 11/24/2024
Operations
Publications
Manfredi M, Savojardo C, Martelli PL, Casadio R. DeepREx-WS: A web server for characterising protein–solvent interaction starting from sequence. Computational and Structural Biotechnology Journal. 2021;19:5791-5799. doi:10.1016/j.csbj.2021.10.016. PMID:34765094. PMCID:PMC8566768.
Savojardo C, Manfredi M, Martelli PL, Casadio R. Solvent Accessibility of Residues Undergoing Pathogenic Variations in Humans: From Protein Structures to Protein Sequences. Frontiers in Molecular Biosciences. 2021;7. doi:10.3389/fmolb.2020.626363. PMID:33490109. PMCID:PMC7817970.
Documentation
Downloads
- Container filehttps://hub.docker.com/r/bolognabiocomp/deeprex
- Source codehttps://github.com/BolognaBiocomp/deeprex