CAEN

CAEN (Category Encoding) for RNA-Seq Data Classification. The software tool uses bulk and single-cell RNA-seq data to detect differentially expressed (DE) genes. The tool employs several statistical methods for the classification of these data. However, many genes are not differentially expressed and are therefore irrelevant for class distinction. Thus, the tool aims to remove these unrelated genes to improve the classification performance and reduce computation time.
The CAEN method proposes a new way of selecting feature genes for classification by encoding categories based on the rank of sequence samples for each gene in each class. The tool considers correlation coefficients between genes and classes with the rank of the sample to determine the most compelling feature genes. The Sure Screening method is also established to ensure consistency in the ranking process.

Topic

RNA-Seq;Gene expression;Microarray experiment

Detail

  • Operation: Sequence classification;Expression correlation analysis;Feature selection

  • Software interface: Library

  • Language: R

  • License: GNU General Public License, version 2

  • Cost: Free

  • Version name: 1.11.0

  • Credit: -

  • Input: -

  • Output: -

  • Contact: Zhou Yan 2160090406@email.szu.edu.cn

  • Collection: -

  • Maturity: Stable

Publications

  • Category encoding method to select feature genes for the classification of bulk and single-cell RNA-seq data.
  • Zhou Y, et al. Category encoding method to select feature genes for the classification of bulk and single-cell RNA-seq data. Category encoding method to select feature genes for the classification of bulk and single-cell RNA-seq data. 2021; 40:4077-4089. doi: 10.1002/sim.9015
  • https://doi.org/10.1002/SIM.9015
  • PMID: 34028849
  • PMC: -

Download and documentation


< Back to DB search