sORFs
sORFs catalogs small open reading frames (sORFs) identified by ribosome profiling (RIBO-seq) to enable discovery and proteomic validation of micropeptides and translated sORFs.
Key Features:
- Extensive dataset: Hosts over 263,354 sORFs with ribosome occupancy across six species derived from more than 78 RIBO-seq datasets, including HCT116 (human), E14_mESC (mouse), S2 (Drosophila), zebrafish, rat, and Caenorhabditis elegans.
- Advanced detection pipeline: Detects sORFs in datasets containing solely elongating ribosome profiling data, removing the previous requirement for initiating RIBO-seq data.
- Noise filtering algorithm: Implements a noise filtering algorithm that distinguishes true ribosomal activity from simulated noise to reduce false positives in sORF identification.
- Sequence similarity assessment: Executes an internal BLAST pipeline to evaluate sequence similarity among sORFs for comparative and functional inference.
- Proteomics integration: Integrates a full PRIDE-ReSpin pipeline to reprocess public MS-based proteomics datasets from the PRIDE database for identification of translation events and micropeptide validation.
- Visualization tools: Provides inspection of annotated spectra using the Lorikeet MS/MS viewer for manual review of peptide-spectrum matches.
Scientific Applications:
- Micropeptide discovery: Supports identification and cataloging of micropeptides encoded by sORFs using RIBO-seq evidence.
- Cross-species comparison: Enables comparative sequence analyses and conservation studies among included species using the internal BLAST pipeline.
- Proteomic validation: Facilitates validation of translation events and peptide identification by reprocessing PRIDE MS datasets with PRIDE-ReSpin and inspecting spectra.
- Gene expression and peptide function studies: Aids investigations into gene expression regulation and the functional characterization of peptides encoded by sORFs.
Methodology:
Processes ribosome profiling (RIBO-seq) data using a detection pipeline for elongating RIBO-seq datasets, applies a noise filtering algorithm, performs sequence similarity searches with BLAST, reprocesses PRIDE MS datasets via the PRIDE-ReSpin pipeline, and presents annotated spectra for inspection in the Lorikeet MS/MS viewer.
Topics
Details
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- PHP, JavaScript, SQL
- Added:
- 6/24/2019
- Last Updated:
- 11/24/2024
Operations
Publications
Olexiouk V, Crappé J, Verbruggen S, Verhegen K, Martens L, Menschaert G. sORFs.org: a repository of small ORFs identified by ribosome profiling. Nucleic Acids Research. 2015;44(D1):D324-D329. doi:10.1093/nar/gkv1175. PMID:26527729. PMCID:PMC4702841.
Olexiouk V, Van Criekinge W, Menschaert G. An update on sORFs.org: a repository of small ORFs identified by ribosome profiling. Nucleic Acids Research. 2017;46(D1):D497-D502. doi:10.1093/nar/gkx1130. PMID:29140531. PMCID:PMC5753181.