PepArML
PepArML integrates multiple peptide-search engines to increase the sensitivity and accuracy of peptide identification from tandem mass spectra for proteomics analyses.
Key Features:
- Integrated search-engine support: Integrates Mascot, Tandem (native, k-score, s-score), OMSSA, MyriMatch, and InsPecT (utilizing MS-GF spectral probability scores) for combined analysis.
- Dynamic search configuration: Dynamically reformats spectral data and constructs engine-specific search configurations on-the-fly.
- Machine-learning result combiner: Applies an unsupervised, model-free machine-learning combiner to select optimal peptide identifications per spectrum using enzymatic digestion patterns, retention time, precursor isotope clusters, mass accuracy, and proteotypic peptide properties.
- False-discovery rate estimation: Estimates false-discovery rates to provide confidence metrics for identifications.
- Scalable computation: Leverages cluster, grid, and cloud computing resources for large-scale peptide identification tasks.
- Standardized output: Emits peptide identifications in pepXML format.
Scientific Applications:
- Increased identification sensitivity: Identifies two- to three-fold more spectra than individual search engines at a 10% false-discovery rate.
- Proteomics mapping: Supports protein function, interaction, and modification mapping in large-scale mass spectrometry proteomics analyses.
Methodology:
Runs Mascot, Tandem (native, k-score, s-score), OMSSA, MyriMatch, and InsPecT (MS-GF spectral probability scores); dynamically reformats spectra and constructs engine-specific search configurations; combines engine outputs with an unsupervised, model-free machine-learning combiner evaluating enzymatic digestion patterns, retention time, precursor isotope clusters, mass accuracy, and proteotypic peptide properties; estimates FDR; outputs pepXML; and utilizes cluster, grid, and cloud computing for processing.
Topics
Collections
Details
- Tool Type:
- web application
- Added:
- 3/5/2018
- Last Updated:
- 11/25/2024
Operations
Data Inputs & Outputs
Peptide identification
Outputs
Publications
Edwards NJ. PepArML: A Meta‐Search Peptide Identification Platform for Tandem Mass Spectra. Current Protocols in Bioinformatics. 2013;44(1). doi:10.1002/0471250953.bi1323s44. PMID:25663956. PMCID:PMC4317344.