new uORFdb
new uORFdb catalogs upstream open reading frames (uORFs) initiated by AUG and near-cognate start codons within transcript leader sequences of eukaryotic transcripts and integrates sequence, literature, and variation data to support analysis of their roles in regulating downstream translation and encoding non-canonical peptides.
Key Features:
- Comprehensive Data Integration: Integrates literature, sequence data, and variation information into a centralized resource for uORF research.
- Extensive Sequence Database: Contains detailed sequence information for over 2.4 million human uORFs and more than 4.2 million uORFs across 12 additional species.
- Transcript- and Reading-Frame-Specific Models: Provides graphical displays and models that represent multiple uORFs within their transcript and reading-frame contexts.
- Advanced Filtering and Analysis Tools: Implements filters and sequence-related information retrieval for targeted analyses and links entries to UCSC Genome Browser, dbSNP, and ClinVar.
- Genetic Variation Data: Includes uORF-related somatic variation data derived from whole-genome sequencing (WGS) of 677 cancer samples collected by the TCGA consortium.
Scientific Applications:
- Translational Regulation Studies: Enables analysis of uORF-mediated regulation of downstream main protein coding sequence translation.
- Non-Canonical Peptide Discovery: Supports identification and characterization of peptides encoded by uORFs.
- Disease-Associated Variant Interpretation: Facilitates investigation of genetic variations in uORFs linked to diseases, including cancer.
- Cancer Genomics and Somatic Variant Analysis: Provides somatic WGS-derived uORF variation data for cancer-related studies using TCGA samples.
- Comparative and Evolutionary Studies: Offers cross-species uORF datasets to support evolutionary and functional comparisons.
- Personalized Medicine Research: Supplies integrated variation and sequence data applicable to patient-specific variant interpretation in clinical research contexts.
Methodology:
Integrates literature, sequence, and variation data; generates graphical transcript- and reading-frame-specific models; applies filters for sequence retrieval and analysis; incorporates somatic variants from WGS of 677 TCGA cancer samples; and links entries to UCSC Genome Browser, dbSNP, and ClinVar.
Topics
Details
- Cost:
- Free of charge
- Tool Type:
- web application
- Operating Systems:
- Mac, Linux, Windows
- Programming Languages:
- Perl, Python, Shell
- Added:
- 12/22/2022
- Last Updated:
- 11/24/2024
Operations
Data Inputs & Outputs
Database search
Publications
Manske F, Ogoniak L, Jürgens L, Grundmann N, Makałowski W, Wethmar K. The new uORFdb: integrating literature, sequence, and variation data in a central hub for uORF research. Nucleic Acids Research. 2022;51(D1):D328-D336. doi:10.1093/nar/gkac899. PMID:36305828. PMCID:PMC9825577.