DeepSacon
DeepSacon predicts solvent accessibility and contact number from one-dimensional amino acid sequences to provide constraints for protein folding models and three-dimensional (3D) protein structure construction.
Key Features:
- Prediction targets: Solvent accessibility (three-state and 15-state) and contact number predicted from one-dimensional amino acid sequences.
- Model architecture: Deep neural networks employing a stacked autoencoder architecture with dropout to enhance robustness and prevent overfitting.
- Performance on monomeric soluble globular proteins: On a dataset of 5,729 monomeric soluble globular proteins it achieves three-state accuracy of 0.70, 15-state accuracy of 0.33, and a Pearson Correlation Coefficient (PCC) of 0.74 for contact number predictions.
- CASP11 benchmark performance: On the CASP11 dataset it attains three-state accuracy of 0.68 and PCC of 0.69 for both solvent accessibility and contact number predictions.
Scientific Applications:
- Protein folding constraints: Provide quantitative constraints for protein folding models derived from predicted solvent accessibility and contact number.
- 3D structure modeling: Aid construction and refinement of three-dimensional protein structures using predicted residue-level structural properties.
- Stability and interaction analysis: Inform analyses of protein stability and intermolecular interactions through solvent accessibility and contact number information.
Methodology:
DeepSacon applies deep neural networks using a stacked autoencoder architecture combined with dropout to predict solvent accessibility and contact number from one-dimensional amino acid sequences.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux
- Added:
- 7/21/2018
- Last Updated:
- 11/25/2024
Operations
Data Inputs & Outputs
Protein property calculation
Outputs
Publications
Deng L, Fan C, Zeng Z. A sparse autoencoder-based deep neural network for protein solvent accessibility and contact number prediction. BMC Bioinformatics. 2017;18(S16). doi:10.1186/s12859-017-1971-7. PMID:29297299. PMCID:PMC5751690.