DMVFL-RSA

DMVFL-RSA predicts protein relative solvent accessibility (RSA) by applying a deep multi-view feature learning framework that integrates multiple neural network architectures and sequence-based features to improve RSA prediction accuracy.


Key Features:

  • Deep Multi-View Feature Learning Framework: Employs a DMVFL framework to learn complementary representations from multiple views of sequence information.
  • Neural Network Units: Integrates Bidirectional Long Short-Term Memory Recurrent Neural Network (BiLSTM RNN), Squeeze-and-Excitation Networks, and fully-connected hidden layers.
  • Sequence-Based Single-View Features: Incorporates Position-Specific Scoring Matrix (PSSM), Position-Specific Frequency Matrix (PSFM), predicted secondary structure, and roughly predicted three-state RSA probability as inputs.
  • Multiple Feedback Mechanism: Uses a customized feedback mechanism to extract and enhance discriminative information from single-view features.
  • Prediction Modes and Thresholds: Supports two-state (25%), three-state (9% and 36%), and four-state (4%, 25%, 50%) RSA classification thresholds.
  • Benchmark Evaluation and Metrics: Evaluated on TEST524 and CASP14-derived (CASP14set) datasets and reports high Pearson correlation coefficient values for real-valued RSA predictions.

Scientific Applications:

  • Two-State Prediction: Classifies residues as exposed or buried using an exposure threshold of 25%.
  • Three-State Prediction: Classifies residues into three exposure states using exposure thresholds of 9% and 36%.
  • Four-State Prediction: Classifies residues into four exposure states using exposure thresholds of 4%, 25%, and 50%.
  • Real-Valued RSA Prediction and Correlation Analysis: Produces real-valued RSA predictions assessed by Pearson correlation coefficient to quantify agreement with native RSA.
  • Benchmarking and Method Comparison: Enables performance comparison against state-of-the-art predictors on TEST524 and CASP14set datasets.

Methodology:

Integrates PSSM, PSFM, predicted secondary structure, and roughly predicted three-state RSA probability as single-view inputs into a DMVFL architecture combining BiLSTM RNN, Squeeze-and-Excitation networks, and fully-connected hidden layers, with a customized feedback mechanism; evaluated on TEST524 and CASP14set using Pearson correlation coefficient for real-valued RSA.

Topics

Details

Cost:
Free of charge
Tool Type:
web application, workflow
Operating Systems:
Linux
Programming Languages:
Python, Scheme
Added:
2/23/2022
Last Updated:
2/23/2022

Operations

Publications

Fan X, Hu J, Jia N, Yu D, Zhang G. Improved protein relative solvent accessibility prediction using deep multi-view feature learning framework. Analytical Biochemistry. 2021;631:114358. doi:10.1016/j.ab.2021.114358. PMID:34478704.

PMID: 34478704
Funding: - National Natural Science Foundation of China: 61772273, 61773346, 61902352, 62072243

Links