InterProScan (EBI)
InterProScan (EBI) predicts protein function by comparing protein sequences to the InterPro protein signature databases for large-scale sequence characterization.
Key Features:
- Sequence-based function prediction: Matches protein sequences against the InterPro protein signature databases to infer protein function.
- Java-based architecture: Implements a reengineered Java-based architecture that improves flexibility and stability.
- Scalable distributed analysis: Utilizes multiprocessor machines and conventional clusters to perform scalable distributed data analysis.
- High-throughput processing: Engineered to handle analyses at the scale of millions of sequences.
- EMBL-EBI integration: Integrated into EMBL-EBI search and sequence analysis tools frameworks to connect with broader data resources and analytical tools.
- Published references: Major version and framework details are described in Publication 1 (PMID: 24451626) and Publication 2 (PMID: 35412617).
Scientific Applications:
- Protein annotation: Automated annotation of protein function across genomic and proteomic datasets.
- Large-scale sequence characterization: Characterization of large proteome collections and high-throughput sequencing outputs.
- Rapid-response analyses: Support for scalable analyses of large datasets arising in rapid-response contexts, including the COVID-19 pandemic.
Methodology:
Sequence analysis is performed against InterPro protein signature databases by a Java-based implementation that exploits multiprocessor machines and conventional clusters for scalable distributed processing capable of handling millions of sequences.
Topics
Collections
Details
- Tool Type:
- api, web application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 11/4/2022
- Last Updated:
- 11/24/2024
Operations
Publications
Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Research. 2022;50(W1):W276-W279. doi:10.1093/nar/gkac240. PMID:35412617. PMCID:PMC9252731.
Jones P, Binns D, Chang H, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S, Quinn AF, Sangrador-Vegas A, Scheremetjew M, Yong S, Lopez R, Hunter S. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30(9):1236-1240. doi:10.1093/bioinformatics/btu031. PMID:24451626. PMCID:PMC3998142.
Documentation
Downloads
- Downloads pagehttps://www.ebi.ac.uk/interpro/download/