pfsearch

pfsearch performs protein domain detection by searching sequences with PROSITE generalized profiles to identify signatures for functional annotation in genomes and metagenomes.


Key Features:

  • Optimized Performance: Leverages modern CPU instructions and multicore parallelism to achieve up to two orders of magnitude faster searches on x86_64 hyper‑threaded quad‑core systems.
  • Heuristic Filtering: Integrates a heuristic filter that rapidly scores candidate matches and applies heuristic cut-off scores to reduce the search space.
  • Implementation: Implemented in C and compatible with Linux, with optimizations for x86_64 multicore processors using modern CPU instructions.
  • PROSITE Profile Support: Uses PROSITE generalized profiles to detect domain signatures for precise functional annotation.

Scientific Applications:

  • Genome Annotation: Accelerates large-scale genome annotation pipelines by enabling rapid detection of PROSITE-profiled protein domains.
  • Metagenomic Analysis: Enables high-throughput metagenome annotation through fast domain detection across large sequence datasets.
  • Functional Annotation and Database Searches: Facilitates extensive database searches for protein domain signatures and functional annotation in proteomes and sequence datasets.

Methodology:

Performs profile-based searches against PROSITE generalized profiles using an optimized C implementation that exploits modern CPU instructions and multicore processing and applies a heuristic filter that scores candidates and uses heuristic cut-off scores to select matches.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
Fortran, C
Added:
8/3/2017
Last Updated:
11/25/2024

Operations

Publications

Schuepbach T, Pagni M, Bridge A, Bougueleret L, Xenarios I, Cerutti L. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics. 2013;29(9):1215-1217. doi:10.1093/bioinformatics/btt129. PMID:23505298. PMCID:PMC3634184.

Documentation

Links