RADAR (EBI)
RADAR identifies gapped approximate repeats and complex repeat architectures within protein sequences to characterize internal duplications and repeat unit organization.
Key Features:
- Automatic Segmentation: Implements a three-step algorithmic approach to segment query protein sequences into repeat units without prior assumptions about repeat number or length.
- Repeat Length Determination: Utilizes spacing between suboptimal self-alignment traces to ascertain repeat lengths.
- Optimization of Repeat Borders: Adjusts repeat borders to maximize an integer number of repeats and refine segmentation boundaries.
- Validation of Distant Repeats: Employs iterative profile alignment to validate and detect distant or divergent repeats.
- Comprehensive Detection: Detects short composition-biased repeats, gapped approximate repeats, and complex repeat architectures involving multiple repeat types.
- Autonomous Operation: Operates without manual intervention or prior knowledge of repeat structures, performing automated repeat detection and segmentation.
- Coverage and Accuracy Assessment: Demonstrates good coverage, accurate alignments, and reasonable repeat borders through comparative analysis with the Pfam-A database.
- Novel Repeat Discovery: Screening against databases such as Swissprot revealed approximately 3,000 repeats not previously annotated in existing domain databases.
Scientific Applications:
- Protein evolution analysis: Enables analysis of internal duplication and repeat-driven evolutionary processes in proteins.
- Functional and structural unit identification: Facilitates identification of repeat-derived functional and structural units within large proteins and aids annotation of novel repeats.
Methodology:
Analyzing self-alignment traces to determine repeat lengths; iterative profile alignment to refine and validate repeats; and comparison against databases such as Pfam-A and Swissprot for validation and discovery.
Topics
Collections
Details
- Tool Type:
- api, web application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 1/29/2015
- Last Updated:
- 11/24/2024
Operations
Publications
Heger A, Holm L. Rapid automatic detection and alignment of repeats in protein sequences. Proteins: Structure, Function, and Genetics. 2000;41(2):224-237. doi:10.1002/1097-0134(20001101)41:2<224::aid-prot70>3.0.co;2-z. PMID:10966575.
Madeira F, Madhusoodanan N, Lee J, Eusebi A, Niewielska A, Tivey ARN, Lopez R, Butcher S. The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Research. 2024;52(W1):W521-W525. doi:10.1093/nar/gkae241. PMID:38597606. PMCID:PMC11223882.
Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Research. 2022;50(W1):W276-W279. doi:10.1093/nar/gkac240. PMID:35412617. PMCID:PMC9252731.
Documentation
Downloads
- Downloads pagehttps://sourceforge.net/projects/repeatradar/