TopHat-Fusion

TopHat-Fusion identifies fusion transcripts from RNA sequencing (RNA-seq) data by detecting fusion junctions and novel splice variants arising from chromosomal rearrangements, enabling discovery of gene fusions relevant to disease such as cancer.


Key Features:

  • RNA-seq input: Processes RNA sequencing (RNA-seq) data to interrogate the transcriptome, including coding and non-coding regions.
  • Annotation-independent alignment: Aligns reads without relying on pre-existing gene annotations, enabling detection from unannotated genomic regions.
  • Fusion junction detection: Aligns reads across potential fusion points to identify junctions indicative of gene fusions.
  • Novel splice variant discovery: Detects novel splice variants of known genes as well as fusion transcripts originating from previously unannotated regions.
  • Scalability for high-throughput data: Employs a robust algorithmic framework that handles large RNA-seq datasets typical of high-throughput sequencing.
  • TopHat lineage: Extends the TopHat alignment approach to enable fusion detection.
  • Known and novel fusion recovery: Recovers previously reported fusion genes and discovers novel fusions with supporting read evidence.

Scientific Applications:

  • Cancer fusion discovery: Detection of gene fusions in cancer studies, with example applications in breast and prostate cancer cell lines.
  • Transcriptome-wide fusion screening: Annotation-independent, transcriptome-wide identification of fusion transcripts and novel splice variants.
  • Validation of reported fusions: Confirmation and validation of previously reported fusion events using RNA-seq evidence.

Methodology:

Analyzes RNA-seq reads by performing annotation-independent alignment and mapping reads across candidate fusion junctions to identify fusion transcripts within a computational framework designed for large high-throughput datasets.

Topics

Details

Maturity:
Mature
Tool Type:
command-line tool
Programming Languages:
Python
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Publications

Kim D, Salzberg SL. TopHat-Fusion: an algorithm for discovery of novel fusion transcripts. Genome Biology. 2011;12(8). doi:10.1186/gb-2011-12-8-r72. PMID:21835007. PMCID:PMC3245612.

Documentation