rstoolbox
The rstoolbox is a Python library that supports the analysis and management of large-scale protein structure and sequence datasets, mainly focusing on the needs of computational protein design (CPD) applications. As experimental and computational methods in biological research rapidly generate vast amounts of data, there's a crucial demand for tools that can efficiently process and analyze these datasets to advance our understanding of protein folding, stability, and function.
CPD is a structure-based approach that leverages computational methods to engineer proteins for novel functions, requiring the generation and analysis of large numbers of structural models to identify optimal structure-sequence configurations. A pivotal step in CPD workflows is the selection of a promising subset of sequences for experimental characterization, a task complicated by the limitations of current CPD scoring functions, necessitating multi-step design protocols and sophisticated analysis of decoy populations to enhance the chances of experimental success.
The rstoolbox meets this need by offering functionalities tailored to CPD software users and developers. For users, it provides capabilities to profile and select decoy sets, assisting in the refinement of design protocols and identifying sequences for further experimental investigation. It features intuitive solutions for visualizing extensive sequence and structure datasets, such as logo plots and heatmaps. It supports analyzing experimental data derived from biochemical techniques and high-throughput sequencing.
For developers, the rstoolbox serves as a valuable framework for benchmarking and comparing different CPD strategies, facilitating the development and refinement of CPD software. Demonstrated applications of the rstoolbox in both user and developer contexts highlight its versatility and effectiveness in CPD-related tasks.
Significantly, the rstoolbox is designed for interactive use, offering seamless integration with IPython environments and capable of operating in high-performance computing settings. This dual capability ensures that the rstoolbox is not only accessible for exploratory analysis and visualization but also robust enough to handle the computational demands of large-scale data analysis.
Topic
Protein folding, stability and design;Protein folds and structural domains
Detail
Operation: Ab initio structure prediction;Molecular docking;Scaffolding
Software interface: Library
Language: Python
License: The MIT License
Cost: Free with restrictions
Version name: -
Credit: EPFL-Fellows grant, the Swiss Systemsx.ch initiative, the ERC, the SNSF and the Biltema Foundation.
Input: -
Output: -
Contact: Jaume Bonet jaume.bonet@gmail.com
Collection: -
Maturity: Stable
Publications
- rstoolbox - a Python library for large-scale analysis of computational protein design data and structural bioinformatics.
- Bonet J, et al. rstoolbox - a Python library for large-scale analysis of computational protein design data and structural bioinformatics. rstoolbox - a Python library for large-scale analysis of computational protein design data and structural bioinformatics. 2019; 20:240. doi: 10.1186/s12859-019-2796-3
- https://doi.org/10.1186/s12859-019-2796-3
- PMID: 31092198
- PMC: PMC6521408
Download and documentation
Source: https://github.com/jaumebonet/RosettaSilentToolbox/releases/tag/v1.0.0
Documentation: https://github.com/jaumebonet/RosettaSilentToolbox/blob/master/README.rst
Home page: https://github.com/jaumebonet/RosettaSilentToolbox
< Back to DB search