DescribePROT

DescribePROT integrates 10 predictive algorithms to provide amino acid-level predictions of protein structure and function across 83 complete proteomes.


Key Features:

  • Integration of predictive algorithms: Combines outputs from ten distinct prediction algorithms to produce residue-level annotations.
  • Descriptor set: Provides 13 residue-level descriptors, including sequence conservation, position-specific scoring matrices (PSSMs), secondary structure, solvent accessibility, intrinsic disorder, disordered linkers, signal peptides, motifs for receptor flexibility (MoRFs), and predictions of protein–protein, protein–DNA, and protein–RNA interactions.
  • Proteome coverage: Annotations span 83 complete proteomes, including key model organisms.
  • Scale of data: Contains approximately 1.4 million proteins covering nearly 600 million amino acids.
  • Pre-computed predictions: Comprises about 7.8 billion pre-computed residue-level predictions.

Scientific Applications:

  • Protein function analysis: Supports residue-level investigations into protein structure–function relationships.
  • Therapeutics and disease research: Provides annotation data applicable to projects targeting therapeutics and disease mechanisms.
  • Predictor development and benchmarking: Serves as a resource for developing and benchmarking novel predictors of protein sequence descriptors.
  • Comparative proteomics: Enables comparative analyses across complete proteomes and model organisms.

Methodology:

Outputs were generated by integrating ten prediction algorithms to compute 13 residue-level descriptors across 83 complete proteomes, producing ~7.8 billion pre-computed residue-level predictions covering ~1.4 million proteins (~600 million amino acids).

Topics

Details

Tool Type:
web application
Added:
1/18/2021
Last Updated:
11/24/2024

Operations

Publications

Zhao B, Katuwawala A, Oldfield CJ, Dunker AK, Faraggi E, Gsponer J, Kloczkowski A, Malhis N, Mirdita M, Obradovic Z, Söding J, Steinegger M, Zhou Y, Kurgan L. DescribePROT: database of amino acid-level protein structure and function predictions. Nucleic Acids Research. 2020;49(D1):D298-D308. doi:10.1093/nar/gkaa931. PMID:33119734. PMCID:PMC7778963.

PMID: 33119734
PMCID: PMC7778963
Funding: - National Science Foundation: 1617369, 1661391 - National Institutes of Health: R01 GM127701