zFPKM

zFPKM normalizes gene-level FPKM and TPM values into a zFPKM metric to separate biologically active genes from ultralow-expression background and to distinguish biologically relevant transcripts from technical or biological noise.


Key Features:

  • Normalization Metric: Introduces a zFPKM normalization metric that differentiates active genes from ultralow-expression genes associated with repressed chromatin states.
  • Threshold Identification: Defines a robust expression threshold (zFPKM > -3) for selecting expressed genes.
  • Validation with Chromatin States: Uses orthogonal chromatin-state information, such as data from the ENCODE project, to validate the expression threshold.
  • Robustness to Variability: Demonstrates resilience to experimental and analytical variation across different RNA-seq datasets.
  • Gene-Level Specificity: Calibrated for gene-level quantitation using FPKM or TPM and reported to be less suitable for transcript-level calibration.

Scientific Applications:

  • Transcriptome Analysis: Separates active genes from background to improve accuracy of gene-level transcriptome characterization.
  • RNA-seq Study Design: Informs study design by recommending an approximate read depth (20–30 million mapped reads) for high-confidence quantitation at the identified threshold.
  • Validation and Calibration: Provides a framework for validating RNA-seq expression estimates using chromatin-state data from resources such as ENCODE.

Methodology:

Compute a zFPKM normalization metric from gene-level FPKM or TPM values, apply the threshold zFPKM > -3 to define expressed genes, and validate this threshold using chromatin-state data from ENCODE.

Topics

Collections

Details

License:
GPL-3.0
Cost:
Free of charge
Tool Type:
library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
R
Added:
7/27/2018
Last Updated:
1/10/2019

Operations

Publications

Hart T, Komori H, LaMere S, Podshivalova K, Salomon DR. Finding the active genes in deep RNA-seq gene expression studies. BMC Genomics. 2013;14(1):778. doi:10.1186/1471-2164-14-778. PMID:24215113. PMCID:PMC3870982.

Documentation

Downloads

Links