ORMAN

ORMAN assigns multimapping RNA-Seq reads to specific transcripts to resolve mapping ambiguity and improve transcript quantification and detection of novel transcriptomic events.


Key Features:

  • Resolution of Mapping Ambiguity: Assigns each multimapping read to a specific transcript using an estimated distribution of the genomic region covering the read so reads contribute accurately to transcript quantification.
  • Combinatorial Optimization Approach: Formulates a combinatorial optimization problem to compute the minimum number of potential transcript products per gene and solves it using approximation algorithms, integer linear programs, and heuristics.
  • Enhancement of Transcriptomic Analysis: Improves performance of transcript discovery and quantification methods such as Cufflinks, IsoLasso, and CLIIQ by providing more accurate read assignments for novel transcripts.
  • Validation on Simulated and Real Data: Validated on simulated RNA-Seq datasets (including random subsets of transcripts from the UCSC database) and on real RNA-Seq reads, producing coverage values that closely resemble original distributions, including genes with highly non-uniform coverage.

Scientific Applications:

  • Alternative Splicing Analysis: Enables detection and quantification of novel splicing events and indels by resolving ambiguous read mappings in RNA-Seq data.
  • Gene Fusion and Transcript Variation Detection: Supports identification of gene fusions and other transcript variations by improving assignment of multimapping reads.
  • Transcript Discovery and Quantification: Enhances algorithms for transcript reconstruction and expression quantification from RNA-Seq input.
  • Functional Genomics: Provides more precise transcript-level coverage estimates to support basic biological discovery and functional genomics studies.

Methodology:

Assigns multimapping reads based on an estimated distribution of the region covering each read and formulates a combinatorial optimization problem to compute a minimum set of transcript products per gene, solved using approximation algorithms, integer linear programming, and heuristics.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Added:
8/3/2017
Last Updated:
11/25/2024

Operations

Publications

Dao P, Numanagić I, Lin Y, Hach F, Karakoc E, Donmez N, Collins C, Eichler EE, Sahinalp SC. ORMAN: Optimal resolution of ambiguous RNA-Seq multimappings in the presence of novel isoforms. Bioinformatics. 2013;30(5):644-651. doi:10.1093/bioinformatics/btt591. PMID:24130305.

Documentation

Links