Pyteomics
Pyteomics provides Python modules for programmatic processing and analysis of mass spectrometry (MS)-based proteomics data, enabling parsing of LC-MS/MS files, interpretation of search-engine outputs, interaction with protein sequence databases, computation of masses and m/z, and prediction of peptide retention times and electrochemical properties.
Key Features:
- LC-MS/MS data parsing: Reads and exposes spectra and chromatograms from LC-MS/MS files.
- Search-engine output interpretation: Parses and processes search-engine output to extract proteomic search results.
- Protein sequence database handling: Provides tools to manage and query protein sequence databases.
- Theoretical predictions: Computes theoretical retention times and electrochemical properties of polypeptides.
- Mass and m/z calculations: Calculates accurate masses and mass-to-charge (m/z) values for peptides and proteins.
- Sequence parsing and analysis: Parses and manipulates protein and peptide sequences, supporting modification analysis and sequence-alignment tasks.
- File format support: Supports proteomics file formats including mzML, MGF, and pepXML.
Scientific Applications:
- Peptide identification and annotation: Facilitates peptide identification workflows by enabling sequence database queries and mass calculations.
- Protein identification and quantification: Supports identification and quantification from LC-MS/MS data through spectrum parsing and result interpretation.
- MS/MS spectrum interpretation: Enables downstream analysis of search-engine outputs and spectral data for proteomic inference.
- Retention time and property prediction: Aids experimental design and data interpretation via retention time and electrochemical property predictions.
- Mass-accuracy based identification: Supports m/z calculations to assist precise peptide and protein identification in mass spectrometry data.
Methodology:
Parsing LC-MS/MS files (e.g., mzML, MGF, pepXML), interpreting search-engine outputs, querying protein sequence databases, predicting retention times and electrochemical properties, calculating masses and m/z, and parsing sequences for modification analysis.
Topics
Collections
Details
- License:
- Apache-2.0
- Tool Type:
- library
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Python
- Added:
- 1/17/2017
- Last Updated:
- 7/4/2025
Operations
Publications
Goloborodko AA, Levitsky LI, Ivanov MV, Gorshkov MV. Pyteomics—a Python Framework for Exploratory Data Analysis and Rapid Software Prototyping in Proteomics. Journal of the American Society for Mass Spectrometry. 2013;24(2):301-304. doi:10.1007/s13361-012-0516-6. PMID:23292976.
Levitsky LI, Klein JA, Ivanov MV, Gorshkov MV. Pyteomics 4.0: Five Years of Development of a Python Proteomics Framework. Journal of Proteome Research. 2018;18(2):709-714. doi:10.1021/acs.jproteome.8b00717.
Documentation
Downloads
- Software packagehttps://pypi.org/project/pyteomics/