CITRUS

CITRUS infers patient-specific transcription factor activities and transcriptional regulatory programs from somatic alterations and chromatin-informed multi-omic data in cancer using a partially interpretable neural network with a self-attention mechanism.


Key Features:

  • Partially Interpretable Neural Network Model: Integrates genomic, transcriptomic, and epigenomic data to model how somatic alterations affect transcriptional regulation.
  • Self-Attention Mechanism: Employs a self-attention module to capture the contextual impact of somatic alterations on gene expression.
  • Explicit Transcription Factor Representation: Includes hidden nodes that explicitly represent TF states and enable prediction of TF activities.
  • TF–Target Linking via Binding Motifs in Open Chromatin: Learns connections between TFs and target genes based on binding motifs located in open chromatin regions.
  • Patient-Specific TF Activity Prediction: Predicts TF activity profiles at the individual tumor level, enabling identification of TFs associated with somatic alterations.
  • Multi-cancer Application on TCGA: Applied to data from 17 cancer types profiled by The Cancer Genome Atlas (TCGA) to analyze diverse tumor profiles.
  • Multi-omic Integration to Model Somatic Alteration Influence: Integrates multiple omic layers to infer the influence of somatic alterations on downstream gene expression.

Scientific Applications:

  • Precision oncology: Predicts patient-specific TF activities to identify regulatory drivers linked to somatic alterations for individualized interpretation.
  • Transcriptional program characterization: Characterizes variation in transcriptional programs between and within tumor types.
  • Mechanistic interpretation of gene dysregulation: Links somatic alterations to downstream TF activity and gene expression changes to explain tumor-specific dysregulation.
  • Prioritization for therapeutic strategies: Identifies candidate TFs and regulatory programs that can inform personalized therapeutic hypotheses.

Methodology:

Uses a partially interpretable neural network with hidden nodes representing TFs and a self-attention mechanism, integrating genomic, transcriptomic, and epigenomic (open chromatin) data and using TF binding motifs in open chromatin regions to learn TF–target gene connections and infer TF activity from somatic alterations.

Topics

Details

License:
MIT
Cost:
Free of charge
Tool Type:
command-line tool
Operating Systems:
Linux, Mac
Programming Languages:
Python
Added:
1/8/2023
Last Updated:
11/24/2024

Operations

Data Inputs & Outputs

Essential dynamics

Outputs

    Publications

    Tao Y, Ma X, Palmer D, Schwartz R, Lu X, Osmanbeyoglu HU. Interpretable deep learning for chromatin-informed inference of transcriptional programs driven by somatic alterations across cancers. Nucleic Acids Research. 2022;50(19):10869-10881. doi:10.1093/nar/gkac881. PMID:36243974. PMCID:PMC9638905.

    PMID: 36243974
    PMCID: PMC9638905
    Funding: - National Institutes of Health: R00 CA207871, R01 LM012011, R01HG010589, R21CA216452 - Pennsylvania Department of Health: FP00003273