LAMPA

"LAMPA" (LArge Multidomain Protein Annotator) is a computational tool to enhance the accuracy of estimating statistical significance in profile-profile searches of sequence similarity, particularly for multidomain proteins. Recognizing the challenges posed by using whole proteins as queries—due to the difficulty in establishing homology for similarities near cutoff levels and the complexity introduced by multidomain structures—LAMPA introduces an iterative approach to expand hit coverage gradually.

Key Features and Functionalities:

- Iterative Expansion of Hit Coverage: LAMPA systematically expands the hit coverage of multidomain proteins by re-evaluating the statistical significance of hit similarity using progressively more minor queries defined at each iteration. This approach allows for more precise domain delineation and homology detection.

- Integration with TMHMM and HHsearch: To facilitate its analysis, LAMPA employs TMHMM for the recognition of transmembrane regions and HHsearch for homology detection, combining these tools to significantly improve the annotation of protein domains.

- Pfam Database Annotation: Utilizing the Pfam database, LAMPA provided annotations that outperformed RefSeq expert annotation regarding the number of regions and annotated length for a significant portion of RNA virus polyprotein entries.

- Rationalization of Results: The improvement offered by LAMPA over traditional methods was rationalized based on the dependencies of HHsearch hit statistical significance on the lengths and diversities of query-target pairs, highlighting the importance of considering these factors in sequence similarity searches.

Topic

Protein folds and structural domains;RNA;Proteomics;Sequence analysis;Transcription factors and regulatory sites

Detail

  • Operation: Database search;Transmembrane protein prediction;Fold recognition

  • Software interface: Library

  • Language: R

  • License: GNU General Public License >= version 2

  • Cost: Free with restrictions

  • Version name: 1.0.0

  • Credit: Yhe EU Horizon2020 EVAg 653316 project, the LUMC MoBiLe program, Leiden University Fund (LUF).

  • Input: -

  • Output: -

  • Contact: Alexander E Gorbalenya a.e.gorbalenya@lumc.nl

  • Collection: -

  • Maturity: -

Publications

  • LAMPA, LArge Multidomain Protein Annotator, and its application to RNA virus polyproteins.
  • Gulyaeva AA, et al. LAMPA, LArge Multidomain Protein Annotator, and its application to RNA virus polyproteins. LAMPA, LArge Multidomain Protein Annotator, and its application to RNA virus polyproteins. 2020; 36:2731-2739. doi: 10.1093/bioinformatics/btaa065
  • https://doi.org/10.1093/BIOINFORMATICS/BTAA065
  • PMID: 32003788
  • PMC: PMC7203729

Download and documentation


< Back to DB search