SINA
SINA aligns rRNA gene sequences to SILVA SSU/LSU reference datasets using k-mer searching and partial order alignment (POA) to produce high-quality multiple sequence alignments for taxonomic and phylogenetic analyses.
Key Features:
- High-Throughput Performance: Engineered to process very large numbers of rRNA sequences efficiently for large-scale datasets.
- Alignment Accuracy: Combines k-mer searching and partial order alignment (POA) to maintain high alignment precision on homologous rRNA sequences.
- Benchmark Performance: Reproduced BRAliBase III benchmark MSAs with accuracies of 99.3%, 97.6%, and 96.1%; on a 38,772-sequence benchmark achieved 98.9% and 99.3% accuracy using reference MSAs of 1000 and 5000 sequences, respectively, and demonstrated superior accuracy compared to PyNAST and mothur.
- Reference-Based Alignment: Aligns sequences against the SILVA SSU/LSU Ref datasets and accommodates reference-based alignments (reported support up to 500 sequences).
Scientific Applications:
- Phylogenetic studies: Produces reference-aligned rRNA MSAs suitable for constructing and refining phylogenetic trees.
- Taxonomic classification: Provides alignments compatible with taxonomic assignment workflows using SILVA reference datasets.
- Biodiversity assessments: Enables accurate comparison of rRNA sequences for diversity and community composition analyses.
- Large-scale metagenomic analyses: Supports alignment of extensive rRNA sequence collections typical of metagenomic and amplicon sequencing projects.
Methodology:
Performs k-mer searching combined with partial order alignment (POA) and supports reference-based alignment against SILVA SSU/LSU Ref datasets.
Topics
Collections
Details
- License:
- GPL-3.0
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- C++
- Added:
- 8/3/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Pruesse E, Peplies J, Glöckner FO. SINA: Accurate high-throughput multiple sequence alignment of ribosomal RNA genes. Bioinformatics. 2012;28(14):1823-1829. doi:10.1093/bioinformatics/bts252. PMID:22556368. PMCID:PMC3389763.
Documentation
Downloads
- Source codehttps://github.com/epruesse/SINA/releases