UniProtKB
UniProtKB provides a curated knowledgebase of protein sequences and functional annotations to support protein-centric bioinformatics analyses.
Key Features:
- Integrated Data Sources: Synthesizes literature-derived information and data from multiple biological resources to produce standardized protein records.
- Biocuration and Consortium: Expert biocuration is performed by the UniProt Consortium (European Bioinformatics Institute, SIB Swiss Institute of Bioinformatics, Protein Information Resource) producing manually reviewed annotations.
- UniProtKB/Swiss-Prot: Contains manually annotated records with curator-evaluated computational analyses.
- UniProtKB/TrEMBL: Contains computationally analyzed records that are pending manual annotation.
- Extensive Sequence Collection: Contains over 80 million protein sequences reflecting large-scale genomic data growth.
- Accession Number Expansion: Accession space was extended from six to ten characters to accommodate new entries.
- Proteome Identifier and Annotation Score: Provides proteome identifiers for specific assemblies and an annotation score quantifying the relative amount of known information per entry.
- Update Frequency: Curated data are released on a four-week update cycle.
Scientific Applications:
- Functional Genomics: Supports assignment and analysis of protein functions derived from genomic and transcriptomic data.
- Comparative Proteomics: Enables comparison of proteins across species using annotation scores to identify well-characterized entries.
- Bioinformatics Research: Supplies standardized, comprehensive protein data for algorithm development, annotation transfer, and large-scale analyses.
- Genomic Projects and Provenance Tracking: Facilitates tracking of sequence provenance and specific assemblies via unique proteome identifiers.
Methodology:
UniProtKB combines manual expert biocuration (including literature evaluation and data integration by the UniProt Consortium: EBI, SIB, PIR) with computational analyses to generate and update protein annotations.
Topics
Collections
Details
- License:
- CC-BY-ND-3.0
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 1/21/2015
- Last Updated:
- 11/25/2024
Operations
Data Inputs & Outputs
Gene functional annotation
Inputs
Outputs
Query and retrieval
Inputs
Publications
Unknown Authors. UniProt: a hub for protein information. Nucleic Acids Research. 2014;43(D1):D204-D212. doi:10.1093/nar/gku989. PMID:25348405. PMCID:PMC4384041.
Unknown Authors. Activities at the Universal Protein Resource (UniProt). Nucleic Acids Research. 2013;42(D1):D191-D198. doi:10.1093/nar/gkt1140. PMID:24253303. PMCID:PMC3965022.
Documentation
Citation instructions
http://www.uniprot.org/help/publicationsGeneral
http://www.uniprot.org/helpDownloads
- Binarieshttp://www.uniprot.org/downloads
- Source codehttp://www.uniprot.org/downloads
Links
Repository
http://www.uniprot.org/downloads