PDEGEM
PDEGEM estimates transcript and isoform abundance from RNA-Seq data using a nonlinear, positional-dependent, energy-guided regression model that accounts for non-uniform read distribution and stacking-energy–related interactions between DNA templates and reads.
Key Features:
- Nonlinear Regression Model: Uses a nonlinear regression framework to estimate transcript abundance from RNA-Seq read data.
- Positional Dependency: Incorporates positional dependency and positional weight to model non-uniform read distribution along transcripts.
- Energy-Guided Expression: Integrates stacking energy terms to represent interaction dynamics between DNA templates and newly synthesized reads.
Scientific Applications:
- RNA-Seq Quantification: Precise estimation of transcript and isoform expression levels from RNA-Seq datasets.
- Gene and Isoform Expression Comparison: Derivation of reliable expression measures with demonstrated higher correlation to alternative assays.
- Method Benchmarking: Comparative evaluation of expression-estimation methods, explicitly compared against mseq.
- Cross-platform and Cross-species Trend Analysis: Analysis of common trends across sequencing platforms and species to investigate DNA binding–related mechanisms.
Methodology:
Adapts the Positional Dependent Nearest Neighborhood (PDNN) framework to RNA-Seq by implementing a nonlinear regression model that incorporates positional dependencies, positional weights and stacking-energy terms to fit read distributions and improve correlation with alternative assays.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Windows
- Programming Languages:
- R
- Added:
- 12/18/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Xia Y, Wang F, Qian M, Qin Z, Deng M. PDEGEM: Modeling non-uniform read distribution in RNA-Seq data. BMC Medical Genomics. 2015;8(S2). doi:10.1186/1755-8794-8-s2-s14. PMID:26044773. PMCID:PMC4460722.