supernmotifs

supernmotifs performs alignment-free comparison of RNA secondary structures to identify and represent structural motifs for scalable analysis of RNA structural patterns.


Key Features:

  • Alignment-Free Approach: Employs an alignment-free method to compare RNA secondary structures without sequence alignment.
  • Super-n-Motifs Model: Uses a super-n-motifs model with latent analysis to identify enhanced motifs and capture adjacency relations within structures.
  • Vector Representation: Computes vector representations of RNA secondary structures as linear combinations of identified motifs.
  • Structural Scope: Analyzes both linear and circular RNAs and accommodates complex elements such as pseudoknots and G-quadruplex (G4) motifs.
  • Scalability and Efficiency: Performs large-scale comparisons of up to 10,000 secondary structures with reported efficiency gains of up to four orders of magnitude.
  • Comparative Representation Advantage: Provides improved representation compared to ordered tree, arc-annotated, and string representations for capturing critical RNA structural features.
  • Implementation: Implemented in C++ for computational performance.
  • High-Throughput Data Compatibility: Applicable to datasets generated by high-throughput probing techniques.

Scientific Applications:

  • Structural Motif Discovery: Identification of recurring and enhanced structural motifs within RNA secondary structures.
  • Comparative Structural Analysis: Large-scale comparison and clustering of RNA secondary structures across diverse datasets.
  • RNA Function Studies: Investigation of structural patterns relevant to gene regulation and RNA-mediated biological processes.
  • Interaction and Therapeutic Research: Analysis supporting studies of RNA-protein interactions and development of RNA-based therapeutics.

Methodology:

Uses an alignment-free computational approach implementing a super-n-motifs model with latent analysis to identify motifs and adjacency relations, computes vector representations as linear combinations of motifs, supports linear and circular RNAs including pseudoknots and G-quadruplex (G4) motifs, and reports large-scale comparison capability (up to 10,000 structures) with up to four orders of magnitude efficiency improvement; implemented in C++.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
C++
Added:
6/4/2018
Last Updated:
11/25/2024

Operations

Publications

Glouzon JS, Perreault J, Wang S. The super-n-motifs model: a novel alignment-free approach for representing and comparing RNA secondary structures. Bioinformatics. 2017;33(8):1169-1178. doi:10.1093/bioinformatics/btw773. PMID:28088762.

Documentation