RDDpred

RDDpred predicts condition-specific RNA-editing events from RNA-seq data using a Random Forest classifier to distinguish true edits from sequencing and mapping artefacts.


Key Features:

  • RNA-seq input: Operates on RNA-seq–derived candidate sites to detect RNA-editing events.
  • Random Forest classifier: Uses a Random Forest–based classifier to separate true RNA-editing events from false positives and artefacts.
  • Automatic training example compilation: Compiles condition-specific training examples automatically without requiring experimental validation.
  • Validation on public datasets: Reproduced reported RNA-editing sites with accuracies of 90% and 95% on two publicly available RNA-editing datasets.
  • Negative predictive value (NPV): Achieved NPVs of 75% and 84% on the two validation datasets.
  • Biochemical specificity: Accounts for canonical RNA-editing biochemistry including adenosine (A) to inosine (I) edits mediated by ADAR and cytidine (C) to uridine (U) edits mediated by APOBEC.
  • False discovery reduction: Aims to reduce false discoveries inherent to RNA-seq–based detection of RNA-editing.
  • Reference database statistic: References the public RDD database DARNED reporting over 300,000 editing sites in the human genome (hg19).
  • Condition specificity statistic: Incorporates the observation that 97.62% of registered editing sites are detected in single tissues or specific conditions.
  • Biological impact context: Enables investigation of RNA-editing impacts on protein activity modulation, alternative splicing of mRNA, and miRNA target site substitutions.

Scientific Applications:

  • Condition-specific RNA-editing detection: Identify RNA-editing events that are specific to tissues or experimental conditions from RNA-seq data.
  • Transcriptome-level studies: Support investigations of RNA-editing contributions to protein function, alternative splicing, and miRNA targeting.
  • False-positive filtering: Reduce artefactual calls in RNA-editing discovery workflows derived from RNA-seq.
  • Validation and reproducibility assessment: Reproduce and assess reported RNA-editing sites from public RNA-editing datasets.

Methodology:

Uses a Random Forest classifier trained on automatically compiled condition-specific examples from RNA-seq candidate sites to classify true RNA-editing events versus artefacts, with validation reported on two public datasets (accuracies 90% and 95%; NPVs 75% and 84%).

Topics

Details

Tool Type:
library
Operating Systems:
Linux
Added:
8/3/2017
Last Updated:
11/25/2024

Operations

Publications

Kim M, Hur B, Kim S. RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data. BMC Genomics. 2016;17(S1). doi:10.1186/s12864-015-2301-y. PMID:26817607. PMCID:PMC4895604.

Documentation

Links