miRNAss
miRNAss predicts pre-miRNA hairpins from stem-loop sequences using a semi-supervised learning approach to improve genome-wide microRNA discovery.
Key Features:
- Semi-Supervised Learning Approach: miRNAss leverages both labeled and unlabeled stem-loop sequences to improve prediction accuracy when known pre-miRNA examples are limited.
- Handling of Negative Examples: The method includes an automatic procedure for identifying negative examples to differentiate miRNA hairpins from non-miRNA sequences without manual curation.
- Class Imbalance Management: miRNAss is designed to address extreme class imbalance in genomic datasets, where pre-miRNAs can be outnumbered by non-pre-miRNA sequences by approximately 1:10,000.
- Efficiency and Speed: The approach achieves higher prediction rates and reduced execution times compared to existing supervised methods, enabling large-scale analyses.
Scientific Applications:
- Genome-wide pre-miRNA prediction and validation: Validated on genome-wide datasets from three model species, each containing over one million hairpin sequences, for microRNA discovery and characterization.
Methodology:
Semi-supervised learning that integrates information from unlabeled stem-loop sequences to refine model predictions; the method includes an automatic search for negative examples and provisions to handle severe class imbalance (~1:10,000).
Topics
Collections
Details
- Tool Type:
- library
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- R
- Added:
- 6/20/2018
- Last Updated:
- 11/25/2024
Operations
Publications
Yones C, Stegmayer G, Milone DH. Genome-wide pre-miRNA discovery from few labeled examples. Bioinformatics. 2017;34(4):541-549. doi:10.1093/bioinformatics/btx612. PMID:29028911.
PMID: 29028911
Funding: - Consejo Nacional de Investigaciones Científicas y Técnicas: PIP 2013 117
- Universidad Nacional del Litoral: 2011 548, 2016 082
- Agencia Nacional de Promoción Científica y Tecnológica: PICT 2014 2627