RNAprofile

RNAprofile identifies conserved sequence-and-structure motifs within sets of unaligned RNA sequences to detect functional elements in mRNA and non-coding RNA.


Key Features:

  • Motif Identification: Extracts regions of highest conservation from collections of unaligned RNA sequences using a similarity measure that incorporates nucleotide sequence and secondary-structure potential.
  • Secondary Structure Consideration: Integrates base-pairing and thermodynamic principles to predict optimal secondary structures of candidate motifs, enabling detection of conserved elements such as hairpins.
  • Parameterization: Requires a single input parameter specifying the number of distinct hairpins the motif must contain, without constraining the size, number, or position of individual motif elements.
  • Efficient Search Strategy: Employs a greedy heuristic to reduce computational complexity and avoid exhaustive enumeration of candidate regions.
  • Robustness to Data Variability: Operates when motifs are absent from some input sequences and can handle corrupted datasets while identifying more complex motifs in non-coding RNA.

Scientific Applications:

  • Iron responsive element (IRE): Characterization of IRE stem-loop regulatory motifs in mRNA.
  • Selenocysteine insertion sequence (SECIS): Detection and characterization of SECIS stem-loop structures.
  • Functional element exploration: Investigation of conserved sequence-and-structure functional elements across mRNA and non-coding RNA.

Methodology:

Extract candidate regions from each input sequence whose predicted secondary structure contains the specified number of hairpins; compare extracted regions with a similarity measure combining nucleotide sequence and secondary-structure potential to identify groups of most similar regions including one region per sequence; employ a greedy heuristic to limit search and avoid exhaustive enumeration.

Topics

Collections

Details

License:
GPL-3.0
Maturity:
Mature
Tool Type:
command-line tool
Operating Systems:
Linux, Mac
Programming Languages:
C
Added:
1/22/2015
Last Updated:
11/25/2024

Operations

Publications

Pavesi G. RNAProfile: an algorithm for finding conserved secondary structure motifs in unaligned RNA sequences. Nucleic Acids Research. 2004;32(10):3258-3269. doi:10.1093/nar/gkh650. PMID:15199174. PMCID:PMC434454.

Documentation