FRAMA
FRAMA reconstructs and annotates de novo transcriptomes from second-generation RNA sequencing (RNA-seq) data to produce low-redundant, full-length mRNA and coding sequence catalogs for organisms lacking a reference genome, including resolution of paralogs and alternative splice variants.
Key Features:
- Genome-independent annotation: Operates without a reference genome, enabling annotation of organisms with incomplete or unavailable genomic data.
- Reduction of contig redundancy: Reduces redundancy among assembled contigs to produce a low-redundant transcript catalog.
- Ortholog assignment: Assigns orthology to assembled transcripts to facilitate comparative genomics.
- Correction of misassembled transcripts: Detects and corrects misassemblies, including those caused by gene fusion events.
- Scaffolding of fragmented transcripts: Scaffolds fragmented transcripts to extend sequences, increasing median sequence information (e.g., 1.27-fold increase observed in naked mole-rat).
- Coding sequence identification: Identifies coding sequences (CDSs) within assembled transcripts to annotate full-length CDSs.
Scientific Applications:
- Transcriptome assembly and annotation: Produces de novo transcriptome assemblies and annotations, as demonstrated for the naked mole-rat transcriptome.
- Gene expression studies: Generates high-quality gene models that support detailed gene expression analyses.
- Comparative transcriptomics: Supports comparative studies across species or conditions via orthology assignment and redundancy reduction.
Methodology:
Builds upon de novo assemblies generated by tools such as Trinity and applies post-assembly refinement tasks—including correction of misassemblies, scaffolding of transcripts, orthology assignment, redundancy reduction, and coding sequence identification—and was validated by annotating 21,984 mRNAs corresponding to 16,887 genes and correcting misassemblies in 4,774 genes in the naked mole-rat study.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Mac
- Programming Languages:
- R, Perl
- Added:
- 5/18/2018
- Last Updated:
- 12/10/2018
Operations
Publications
Bens M, Sahm A, Groth M, Jahn N, Morhart M, Holtze S, Hildebrandt TB, Platzer M, Szafranski K. FRAMA: from RNA-seq data to annotated mRNA assemblies. BMC Genomics. 2016;17(1). doi:10.1186/s12864-015-2349-8. PMID:26763976. PMCID:PMC4712544.