spark-cpvs

The software tool 'spark-cpvs' proposes a strategy for structure-based virtual screening of ligands against target proteins, which is based on iteratively docking a set of ligands, training a ligand-based model on this set, and predicting the remainder of the ligands to exclude those predicted as 'low-scoring' ligands. The tool uses SVM and conformal prediction to deliver valid prediction intervals for ranking the predicted ligands and Apache Spark to parallelize both the docking and the modeling. The results show that conformal prediction based virtual screening (CPVS) is able to reduce the number of docked molecules by 62.61%, retain an accuracy for the top 30 hits of 94% on average and have a speedup of 3.7.

Topic

Cheminformatics;Molecular modelling;Compound libraries and screening

Detail

  • Operation: Molecular docking

  • Software interface: Command-line user interface

  • Language: C++

  • License: Apache License 2.0

  • Cost: Free

  • Version name: -

  • Credit: Swedish e-Science Research Center (SeRC) and the strategic research programme eSSENCE.

  • Input: -

  • Output: -

  • Contact: laeeq@kth.se

  • Collection: -

  • Maturity: -

Publications

  • Efficient iterative virtual screening with Apache Spark and conformal prediction.
  • Ahmed L, et al. Efficient iterative virtual screening with Apache Spark and conformal prediction. Efficient iterative virtual screening with Apache Spark and conformal prediction. 2018; 10:8. doi: 10.1186/s13321-018-0265-z
  • https://doi.org/10.1186/s13321-018-0265-z
  • PMID: 29492726
  • PMC: PMC5833896

Download and documentation


< Back to DB search