QOMA

QOMA maximizes the Sum-of-Pairs (SP) score for multiple alignment of protein sequences using a graph-based local optimization strategy.


Key Features:

  • Objective: Maximizes the Sum-of-Pairs (SP) score to assess and improve multiple protein sequence alignments.
  • Input: Operates on protein sequences starting from a preliminary multiple alignment.
  • Graph Representation: Represents the alignment as a K-partite graph to enable structured optimization.
  • Local Optimization: Applies a sliding window approach for targeted local optimizations that iteratively adjust the alignment.
  • Adjustable Parameters: Exposes Window Size and Graph Sparsity parameters to balance computational time and optimization scope.
  • Order Independence: Does not depend on the order in which sequences are aligned, unlike progressive alignment methods.
  • Benchmark Performance: Reported to achieve higher SP scores on BAliBASE compared with ClustalW, Probcons, Muscle, T-Coffee, and DCA, with larger improvements for distant proteins.

Scientific Applications:

  • Phylogenetic analysis: Produces high-quality alignments suitable for evolutionary tree inference.
  • Conserved motif identification: Improves detection of conserved motifs across multiple protein sequences.
  • Evolutionary studies: Supports analyses of sequence divergence and conservation among distant proteins.
  • Functional annotation: Aids transfer of functional information via improved alignments.
  • Structural biology investigations: Provides alignments that can inform comparative modeling and structure-based analyses.

Methodology:

Generate a preliminary alignment, represent the alignment with a K-partite graph, perform sliding-window local optimizations on the graph with iterative adjustments to enhance the SP score, and control behavior via Window Size and Graph Sparsity parameters.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux, Windows, Mac
Programming Languages:
C
Added:
12/18/2017
Last Updated:
2/26/2019

Operations

Data Inputs & Outputs

Multiple sequence alignment

Other operations do not define inputs or outputs.

Publications

Zhang X, Kahveci T. QOMA: quasi-optimal multiple alignment of protein sequences. Bioinformatics. 2006;23(2):162-168. doi:10.1093/bioinformatics/btl590.

Links