Clustal 2 (Clustal W, Clustal X)

Clustal 2 (Clustal W, Clustal X) performs multiple sequence alignment of nucleotide and protein sequences to support comparative genomics, evolutionary analysis, and protein structure and function inference.


Key Features:

  • Improved sensitivity for divergent sequences: Uses individual sequence weighting in partial alignments, dynamic adjustment of amino acid substitution matrices according to sequence divergence at different alignment stages, and residue-specific plus locally reduced gap penalties in hydrophilic regions to favor gap placement in putative loop areas.
  • Progressive multiple sequence alignment: Implements a progressive alignment strategy for constructing MSAs from pairwise relationships and profiles.
  • Profile alignment capability: Supports alignment of multiple sequences and sequence profiles for combined or iterative alignment tasks.
  • Alignment quality analysis: Provides analysis to identify low-scoring segments or exceptional residues to guide iterative refinement and detect input-sequence errors.

Scientific Applications:

  • Conserved motif identification: Aligns nucleotide and protein sequences to detect conserved motifs and sequence signatures.
  • Evolutionary analysis: Facilitates inference of evolutionary relationships among homologous sequences, including distant homologs.
  • Functional domain and structure inference: Supports identification of functional domains and contributes to protein structure and function prediction.
  • Study of divergent sequences: Enables analysis of distant homologs and complex evolutionary scenarios by improving alignment sensitivity for highly divergent sequences.

Methodology:

Employs a progressive alignment strategy enhanced by sequence weighting in partial alignments, dynamic/adaptive amino acid substitution matrices adjusted by sequence divergence at different stages, and residue-specific plus locally reduced gap-penalty schemes in hydrophilic regions.

Topics

Collections

Details

License:
LGPL-3.0
Maturity:
Mature
Cost:
Free of charge
Tool Type:
command-line tool, desktop application
Operating Systems:
Linux, Windows, Mac
Added:
4/21/2017
Last Updated:
11/24/2024

Operations

Data Inputs & Outputs

Multiple sequence alignment

Publications

Larkin M, Blackshields G, Brown N, Chenna R, McGettigan P, McWilliam H, Valentin F, Wallace I, Wilm A, Lopez R, Thompson J, Gibson T, Higgins D. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23(21):2947-2948. doi:10.1093/bioinformatics/btm404. PMID:17846036.

Higgins DG, Sharp PM. CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. Gene. 1988;73(1):237-244. doi:10.1016/0378-1119(88)90330-7.

Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research. 1994;22(22):4673-4680. doi:10.1093/nar/22.22.4673. PMID:7984417. PMCID:PMC308517.

Thompson J. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research. 1997;25(24):4876-4882. doi:10.1093/nar/25.24.4876. PMID:9396791. PMCID:PMC147148.

Documentation

Downloads

Links

Related Tools

clustalw
Relation: isNewVersionOf