cancerclass
cancerclass implements methods to develop and validate diagnostic, prognostic, and predictive classifiers for cancer using high-dimensional molecular data.
Key Features:
- Feature Selection: Identifies relevant biomarkers from high-dimensional molecular data prior to classifier construction.
- Nearest-Centroid Classification: Performs sample classification using nearest-centroid classifiers.
- Validation Methods: Supports Training and Test set validation, Leave-One-Out Cross-Validation, and multiple random validation protocols including an extended multiple random validation protocol.
- Continuous Prediction Scores: Calculates and visualizes continuous prediction scores to enable thresholding and clinical interpretation.
- Score-to-Probability Conversion: Converts prediction scores into probabilities to distinguish clear-cut versus borderline classification results.
- Clinical Thresholding: Provides mechanisms to balance sensitivity and specificity and to define cutoff values for clinical requirements.
- R Package Implementation: Implements the methods as an R package integrating classifier construction and performance assessment functions.
Scientific Applications:
- Diagnostic and Prognostic Test Development: Development and validation of diagnostic, prognostic, and predictive cancer tests from high-dimensional molecular data.
- Cancer Microarray Analysis: Application to analysis of cancer microarray datasets for classifier demonstration and validation.
- Clinical Decision Support: Use of continuous scores and probability conversion to inform clinical decision-making by distinguishing borderline cases.
Methodology:
Pipeline steps explicitly include feature selection, nearest-centroid classification, computation and visualization of continuous prediction scores, conversion of scores to probabilities, and validation via Training/Test splits, Leave-One-Out Cross-Validation, and multiple random validation protocols (including an extended multiple random validation).
Topics
Collections
Details
- License:
- GPL-3.0
- Tool Type:
- command-line tool, library
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- R
- Added:
- 1/17/2017
- Last Updated:
- 12/10/2018
Operations
Data Inputs & Outputs
Format validation
Inputs
Outputs
Publications
Budczies J, Kosztyla D, Törne Cv, Stenzinger A, Drab-Esfahani S, Dietel M, Denkert C. <b>cancerclass</b>: An<i>R</i>Package for Development and Validation of Diagnostic Tests from High-Dimensional Molecular Data. Journal of Statistical Software. 2014;59(1). doi:10.18637/jss.v059.i01.