MzJava

MzJava provides a Java API for processing and analyzing mass spectrometry (MS) data for large-scale proteomics, glycomics, and metabolomics studies.


Key Features:

  • Data Structures and Algorithms: Comprehensive data structures and algorithms for representing and processing mass spectra and associated biological entities such as metabolites, glycans, and peptides.
  • Mass Calculation and Peak Processing: Accurate mass calculation and peak processing including centroiding, filtering, and transforming.
  • Spectrum Alignment and Clustering: Spectrum alignment and clustering functions for comparing and grouping spectra within large datasets.
  • Protein Digestion and Fragmentation: Protein digestion and peptide/glycan fragmentation routines for generating theoretical fragments.
  • Scoring Functions: Scoring functions for spectrum-spectrum matches and peptide/glycan-spectrum matches to support identification.
  • Data Import/Export: Readers and writers for commonly utilized MS file types.
  • Cluster Computing Support: Support for Hadoop MapReduce and Apache Spark for cluster-scale processing.

Scientific Applications:

  • Proteomics: High-throughput identification and analysis of peptides and proteins from MS data.
  • Glycomics: Identification and characterization of glycans and glycopeptides from MS data.
  • Metabolomics: Analysis and identification of metabolites from MS datasets.

Methodology:

Implements data structures and algorithms for mass spectra representation, mass calculation, centroiding, peak filtering and transformation, spectrum alignment and clustering, protein digestion and peptide/glycan fragmentation, scoring for spectrum-spectrum and peptide/glycan-spectrum matches, readers/writers for common MS file formats, and support for Hadoop MapReduce and Apache Spark.

Topics

Collections

Details

Tool Type:
library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
Java
Added:
1/17/2017
Last Updated:
11/25/2024

Operations

Data Inputs & Outputs

Publications

Horlacher O, Nikitin F, Alocci D, Mariethoz J, Müller M, Lisacek F. MzJava: An open source library for mass spectrometry data processing. Journal of Proteomics. 2015;129:63-70. doi:10.1016/j.jprot.2015.06.013. PMID:26141507.

PMID: 26141507
Funding: - Swiss National Science Foundation: 31003A 141215, CRSII3 136282, SNSF 315230 130830 - EU: 316929

Documentation

Links

Software catalogue
http://ms-utils.org