pfsearch
pfsearch performs protein domain detection by searching sequences with PROSITE generalized profiles to identify signatures for functional annotation in genomes and metagenomes.
Key Features:
- Optimized Performance: Leverages modern CPU instructions and multicore parallelism to achieve up to two orders of magnitude faster searches on x86_64 hyper‑threaded quad‑core systems.
- Heuristic Filtering: Integrates a heuristic filter that rapidly scores candidate matches and applies heuristic cut-off scores to reduce the search space.
- Implementation: Implemented in C and compatible with Linux, with optimizations for x86_64 multicore processors using modern CPU instructions.
- PROSITE Profile Support: Uses PROSITE generalized profiles to detect domain signatures for precise functional annotation.
Scientific Applications:
- Genome Annotation: Accelerates large-scale genome annotation pipelines by enabling rapid detection of PROSITE-profiled protein domains.
- Metagenomic Analysis: Enables high-throughput metagenome annotation through fast domain detection across large sequence datasets.
- Functional Annotation and Database Searches: Facilitates extensive database searches for protein domain signatures and functional annotation in proteomes and sequence datasets.
Methodology:
Performs profile-based searches against PROSITE generalized profiles using an optimized C implementation that exploits modern CPU instructions and multicore processing and applies a heuristic filter that scores candidates and uses heuristic cut-off scores to select matches.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux
- Programming Languages:
- Fortran, C
- Added:
- 8/3/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Schuepbach T, Pagni M, Bridge A, Bougueleret L, Xenarios I, Cerutti L. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics. 2013;29(9):1215-1217. doi:10.1093/bioinformatics/btt129. PMID:23505298. PMCID:PMC3634184.