InterProScan (EBI)

InterProScan (EBI) predicts protein function by comparing protein sequences to the InterPro protein signature databases for large-scale sequence characterization.


Key Features:

  • Sequence-based function prediction: Matches protein sequences against the InterPro protein signature databases to infer protein function.
  • Java-based architecture: Implements a reengineered Java-based architecture that improves flexibility and stability.
  • Scalable distributed analysis: Utilizes multiprocessor machines and conventional clusters to perform scalable distributed data analysis.
  • High-throughput processing: Engineered to handle analyses at the scale of millions of sequences.
  • EMBL-EBI integration: Integrated into EMBL-EBI search and sequence analysis tools frameworks to connect with broader data resources and analytical tools.
  • Published references: Major version and framework details are described in Publication 1 (PMID: 24451626) and Publication 2 (PMID: 35412617).

Scientific Applications:

  • Protein annotation: Automated annotation of protein function across genomic and proteomic datasets.
  • Large-scale sequence characterization: Characterization of large proteome collections and high-throughput sequencing outputs.
  • Rapid-response analyses: Support for scalable analyses of large datasets arising in rapid-response contexts, including the COVID-19 pandemic.

Methodology:

Sequence analysis is performed against InterPro protein signature databases by a Java-based implementation that exploits multiprocessor machines and conventional clusters for scalable distributed processing capable of handling millions of sequences.

Topics

Collections

Details

Tool Type:
api, web application
Operating Systems:
Linux, Windows, Mac
Added:
11/4/2022
Last Updated:
11/24/2024

Operations

Publications

Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Research. 2022;50(W1):W276-W279. doi:10.1093/nar/gkac240. PMID:35412617. PMCID:PMC9252731.

PMID: 35412617
PMCID: PMC9252731
Funding: - EMBL-EBI: 824087 - BY-COVID: 101046203 - EarlyCause: 848158

Jones P, Binns D, Chang H, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S, Quinn AF, Sangrador-Vegas A, Scheremetjew M, Yong S, Lopez R, Hunter S. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30(9):1236-1240. doi:10.1093/bioinformatics/btu031. PMID:24451626. PMCID:PMC3998142.

Documentation

Downloads

Links

Related Tools

interpro
Relation: uses