InterProScan (EBI)

InterProScan (EBI) scans protein sequences against the InterPro protein signature databases to predict protein domains, families, and functional annotations for large-scale genomic and proteomic analyses.


Key Features:

  • Protein signature scanning: Scans protein sequences against the InterPro protein signature databases to identify domains, families, and signatures.
  • Java-based architecture: Implements a Java-based architecture that supports execution across multiprocessor machines and conventional clusters.
  • Scalable distributed data analysis: Performs distributed data analysis to enable scalable processing of large sequence datasets on multiprocessor and cluster environments.
  • Framework reimplementation and output enhancements: Includes a reimplemented software framework and enhanced outputs for more detailed and comprehensive annotation results.
  • Integration with EMBL-EBI services: Integrates with EMBL-EBI resources including EBI Search and the Job Dispatcher framework and supports access via RESTful and SOAP APIs.

Scientific Applications:

  • Protein function prediction: Predicts protein function by identifying InterPro signatures, domains, and families within sequences.
  • Large-scale genomic and proteomic annotation: Enables characterization and annotation of large datasets in genomic and proteomic studies, including projects involving millions of sequences.
  • Biological process and disease research: Supports studies of protein roles in biological processes and diseases through domain and signature annotation.

Methodology:

Scans protein sequences against the InterPro protein signature databases using a Java-based distributed analysis framework that runs on multiprocessor machines and clusters and integrates with EMBL-EBI EBI Search and the Job Dispatcher framework via RESTful and SOAP APIs.

Topics

Collections

Details

Tool Type:
api, web application
Operating Systems:
Linux, Windows, Mac
Added:
1/29/2015
Last Updated:
11/24/2024

Operations

Publications

Jones P, Binns D, Chang H, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S, Quinn AF, Sangrador-Vegas A, Scheremetjew M, Yong S, Lopez R, Hunter S. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30(9):1236-1240. doi:10.1093/bioinformatics/btu031. PMID:24451626. PMCID:PMC3998142.

Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Research. 2022;50(W1):W276-W279. doi:10.1093/nar/gkac240. PMID:35412617. PMCID:PMC9252731.

PMID: 35412617
PMCID: PMC9252731
Funding: - EMBL-EBI: 824087 - BY-COVID: 101046203 - EarlyCause: 848158

Documentation

Downloads

Links

Related Tools

interpro
Relation: uses