uQlust

uQlust is a fast and efficient software tool for ranking and clustering large sets of protein and RNA structures. It utilizes structural profiles and combines a linear-time algorithm with profile hashing to enable rapid comparison and clustering of macromolecular structures while minimizing memory usage. uQlust can handle structures of varying lengths and perform hierarchical clustering of the entire Protein Data Bank on a standard laptop. The tool significantly reduces computational complexity and memory requirements compared to existing methods while maintaining result quality for protein and RNA structure analysis.

Topic

Chemistry;Gene and protein families;RNA;Structure analysis

Detail

  • Operation: Sequence clustering

  • Software interface: Command-line interface

  • Language: C++,C#

  • License: Not stated

  • Cost: Free of charge

  • Version name: 3.0

  • Credit: NIH.

  • Input: -

  • Output: -

  • Contact: Rafal Adamczak raad@is.umk.pl

  • Collection: -

  • Maturity: -

Publications

  • UQlust: combining profile hashing with linear-time ranking for efficient clustering and analysis of big macromolecular data.
  • Adamczak R and Meller J. UQlust: combining profile hashing with linear-time ranking for efficient clustering and analysis of big macromolecular data. UQlust: combining profile hashing with linear-time ranking for efficient clustering and analysis of big macromolecular data. 2016; 17:546. doi: 10.1186/s12859-016-1381-2
  • https://doi.org/10.1186/S12859-016-1381-2
  • PMID: 28031034
  • PMC: PMC5198500

Download and documentation


< Back to DB search