RADAR (EBI)

RADAR identifies gapped approximate repeats and complex repeat architectures within protein sequences to characterize internal duplications and repeat unit organization.


Key Features:

  • Automatic Segmentation: Implements a three-step algorithmic approach to segment query protein sequences into repeat units without prior assumptions about repeat number or length.
  • Repeat Length Determination: Utilizes spacing between suboptimal self-alignment traces to ascertain repeat lengths.
  • Optimization of Repeat Borders: Adjusts repeat borders to maximize an integer number of repeats and refine segmentation boundaries.
  • Validation of Distant Repeats: Employs iterative profile alignment to validate and detect distant or divergent repeats.
  • Comprehensive Detection: Detects short composition-biased repeats, gapped approximate repeats, and complex repeat architectures involving multiple repeat types.
  • Autonomous Operation: Operates without manual intervention or prior knowledge of repeat structures, performing automated repeat detection and segmentation.
  • Coverage and Accuracy Assessment: Demonstrates good coverage, accurate alignments, and reasonable repeat borders through comparative analysis with the Pfam-A database.
  • Novel Repeat Discovery: Screening against databases such as Swissprot revealed approximately 3,000 repeats not previously annotated in existing domain databases.

Scientific Applications:

  • Protein evolution analysis: Enables analysis of internal duplication and repeat-driven evolutionary processes in proteins.
  • Functional and structural unit identification: Facilitates identification of repeat-derived functional and structural units within large proteins and aids annotation of novel repeats.

Methodology:

Analyzing self-alignment traces to determine repeat lengths; iterative profile alignment to refine and validate repeats; and comparison against databases such as Pfam-A and Swissprot for validation and discovery.

Topics

Collections

Details

Tool Type:
api, web application
Operating Systems:
Linux, Windows, Mac
Added:
1/29/2015
Last Updated:
11/24/2024

Operations

Publications

Heger A, Holm L. Rapid automatic detection and alignment of repeats in protein sequences. Proteins: Structure, Function, and Genetics. 2000;41(2):224-237. doi:10.1002/1097-0134(20001101)41:2<224::aid-prot70>3.0.co;2-z. PMID:10966575.

Madeira F, Madhusoodanan N, Lee J, Eusebi A, Niewielska A, Tivey ARN, Lopez R, Butcher S. The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Research. 2024;52(W1):W521-W525. doi:10.1093/nar/gkae241. PMID:38597606. PMCID:PMC11223882.

Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Research. 2022;50(W1):W276-W279. doi:10.1093/nar/gkac240. PMID:35412617. PMCID:PMC9252731.

PMID: 35412617
PMCID: PMC9252731
Funding: - EMBL-EBI: 824087 - BY-COVID: 101046203 - EarlyCause: 848158

Documentation

Downloads

Links