Clustal 2 (Clustal W, Clustal X)
Clustal 2 (Clustal W, Clustal X) performs multiple sequence alignment of nucleotide and protein sequences to support comparative genomics, evolutionary analysis, and protein structure and function inference.
Key Features:
- Improved sensitivity for divergent sequences: Uses individual sequence weighting in partial alignments, dynamic adjustment of amino acid substitution matrices according to sequence divergence at different alignment stages, and residue-specific plus locally reduced gap penalties in hydrophilic regions to favor gap placement in putative loop areas.
- Progressive multiple sequence alignment: Implements a progressive alignment strategy for constructing MSAs from pairwise relationships and profiles.
- Profile alignment capability: Supports alignment of multiple sequences and sequence profiles for combined or iterative alignment tasks.
- Alignment quality analysis: Provides analysis to identify low-scoring segments or exceptional residues to guide iterative refinement and detect input-sequence errors.
Scientific Applications:
- Conserved motif identification: Aligns nucleotide and protein sequences to detect conserved motifs and sequence signatures.
- Evolutionary analysis: Facilitates inference of evolutionary relationships among homologous sequences, including distant homologs.
- Functional domain and structure inference: Supports identification of functional domains and contributes to protein structure and function prediction.
- Study of divergent sequences: Enables analysis of distant homologs and complex evolutionary scenarios by improving alignment sensitivity for highly divergent sequences.
Methodology:
Employs a progressive alignment strategy enhanced by sequence weighting in partial alignments, dynamic/adaptive amino acid substitution matrices adjusted by sequence divergence at different stages, and residue-specific plus locally reduced gap-penalty schemes in hydrophilic regions.
Topics
Collections
Details
- License:
- LGPL-3.0
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- command-line tool, desktop application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 4/21/2017
- Last Updated:
- 11/24/2024
Operations
Data Inputs & Outputs
Multiple sequence alignment
Inputs
Outputs
Publications
Larkin M, Blackshields G, Brown N, Chenna R, McGettigan P, McWilliam H, Valentin F, Wallace I, Wilm A, Lopez R, Thompson J, Gibson T, Higgins D. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23(21):2947-2948. doi:10.1093/bioinformatics/btm404. PMID:17846036.
Higgins DG, Sharp PM. CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. Gene. 1988;73(1):237-244. doi:10.1016/0378-1119(88)90330-7.
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research. 1994;22(22):4673-4680. doi:10.1093/nar/22.22.4673. PMID:7984417. PMCID:PMC308517.
Thompson J. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research. 1997;25(24):4876-4882. doi:10.1093/nar/25.24.4876. PMID:9396791. PMCID:PMC147148.
Documentation
Downloads
- Downloads pagehttp://www.clustal.org/clustal2/#Download
- Software packagehttps://tracker.debian.org/pkg/clustalwDebian package (Debian Med Packaging Team). Source code package of the command-line interface Clustal W and the corresponding binary package.
- Software packagehttps://tracker.debian.org/pkg/clustalxDebian package (Debian Med Packaging Team). Source code package of the graphical user interface Clustal X and the corresponding binary package.
- Software packagehttps://bioconda.github.io/recipes/clustalw/README.htmlConda package (Bioconda channel) of the command-line interface Clustal W. Conda recipe at https://github.com/bioconda/bioconda-recipes/tree/master/recipes/clustalw/meta.yaml