TSEBRA
TSEBRA selects transcripts from BRAKER1 and BRAKER2 outputs to produce an evidence-supported set of protein-coding gene models for eukaryotic genomes.
Key Features:
- Integration of Evidence Types: Combines RNA-seq evidence used by BRAKER1 with cross-species protein database evidence used by BRAKER2 to leverage both types of extrinsic support.
- Transcript Selection Mechanism: Applies rule-based comparisons of scores for overlapping transcripts, evaluating support from introns, start/stop codons, RNA-seq alignments, and homologous protein evidence.
- Benchmarking and Accuracy: Demonstrated higher gene prediction accuracy than BRAKER1 or BRAKER2 alone across 11 species and shown competitive performance with combiners such as EVidenceModeler.
Scientific Applications:
- Novel genome annotation: Improves gene annotations in novel eukaryotic genome projects when both RNA-seq and protein evidence are available.
- Genomic research and comparative annotation: Produces more evidence-supported gene models for downstream analyses and comparative studies of eukaryotic genomes.
Methodology:
TSEBRA takes gene predictions generated independently by BRAKER1 and BRAKER2 and applies a transcript selection algorithm that compares scores of overlapping transcripts, prioritizing support from introns, start/stop codons, RNA-seq data, and homologous protein evidence.
Topics
Details
- License:
- Artistic-2.0
- Cost:
- Free of charge
- Tool Type:
- command-line tool
- Operating Systems:
- Mac, Windows, Linux
- Programming Languages:
- Python, Shell
- Added:
- 10/12/2021
- Last Updated:
- 10/12/2021
Operations
Publications
Gabriel L, Hoff KJ, Brůna T, Borodovsky M, Stanke M. TSEBRA: Transcript Selector for BRAKER. Unknown Journal. 2021. doi:10.1101/2021.06.07.447316.
Documentation
Downloads
- Downloads pagehttps://github.com/Gaius-Augustus/TSEBRA