MUSCLE (EBI)

MUSCLE (EBI) performs multiple sequence alignment using the MUSCLE (Multiple Sequence Comparison by Log-Expectation) algorithm to produce accurate alignments for evolutionary, structural, and functional analyses.


Key Features:

  • Algorithmic approach: Uses k-mer counting for fast distance estimation, progressive alignment optimized with the log-expectation score, and refinement via tree-dependent restricted partitioning.
  • Benchmarking and performance: Demonstrated superior or joint-highest accuracy versus T-Coffee, MAFFT, and CLUSTALW on BAliBASE, SABmark, SMART, and PREFAB, and can align large datasets (e.g., 5000 sequences of average length 350 in ~7 minutes on a desktop).
  • EMBL-EBI integration: Implemented within the EMBL-EBI Job Dispatcher framework and interoperates with UniProt, InterPro, ENA, and Ensembl Genomes.

Scientific Applications:

  • Evolutionary analysis: Generation of multi-sequence alignments for phylogenetic and comparative genomics studies.
  • Protein structure prediction: Producing alignments used in homology modeling and secondary/tertiary structure inference.
  • Functional annotation: Alignments to support annotation of conserved motifs, domains, and gene function.
  • Large-scale genomic analyses: Handling high-throughput alignment tasks for extensive sequence datasets.

Methodology:

Computational steps include fast distance estimation via k-mer counting, progressive alignment using the log-expectation score, and refinement by tree-dependent restricted partitioning.

Topics

Collections

Details

Tool Type:
api, web application
Operating Systems:
Linux, Windows, Mac
Added:
1/29/2015
Last Updated:
11/24/2024

Operations

Publications

Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32(5):1792-1797. doi:10.1093/nar/gkh340. PMID:15034147. PMCID:PMC390337.

Madeira F, Madhusoodanan N, Lee J, Eusebi A, Niewielska A, Tivey ARN, Lopez R, Butcher S. The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Research. 2024;52(W1):W521-W525. doi:10.1093/nar/gkae241. PMID:38597606. PMCID:PMC11223882.

Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Research. 2022;50(W1):W276-W279. doi:10.1093/nar/gkac240. PMID:35412617. PMCID:PMC9252731.

PMID: 35412617
PMCID: PMC9252731
Funding: - EMBL-EBI: 824087 - BY-COVID: 101046203 - EarlyCause: 848158

Documentation

Downloads

Links

Related Tools

muscle
Relation: uses