TEMP2
TEMP2 detects transposon insertions from short-read whole-genome sequencing, precisely identifies germline insertions, estimates frequencies of de novo (singleton) insertions, and discriminates chimeric-read artifacts.
Key Features:
- High sensitivity and precision: Demonstrates exceptional sensitivity and precision in detecting germline transposon insertions validated on simulated Drosophila data and experimental data from flies and humans.
- De novo (singleton) insertion detection: Identifies de novo transposon insertions and estimates their frequencies in short-read whole-genome sequencing data.
- Chimeric-read handling: Distinguishes artificial insertion sites introduced by chimeric reads from true de novo insertions, maintaining accuracy at high chimeric-read levels.
- Validation with long-read benchmark: Performance is benchmarked against a PacBio long-read sequencing dataset from Drosophila.
Scientific Applications:
- Understanding genomic instability: Enables investigation of how new transposon insertions contribute to genomic instability.
- Studying evolutionary dynamics: Supports analysis of the role of transposons in host genome evolution.
- Investigating hybrid dysgenesis and transposon regulation: Applied to cases such as hybrid dysgenic flies with de-repressed P-elements to track continuous new insertions prior to re-establishment of piRNA-mediated repression.
Methodology:
Leverages a PacBio long-read sequencing–based benchmark dataset from Drosophila for validation of detection and frequency estimation.
Topics
Details
- Tool Type:
- command-line tool, library
- Programming Languages:
- Shell, Perl, R
- Added:
- 3/19/2021
- Last Updated:
- 11/24/2024
Operations
Publications
Yu T, Huang X, Dou S, Tang X, Luo S, Theurkauf WE, Lu J, Weng Z. A benchmark and an algorithm for detecting germline transposon insertions and measuring <i>de novo</i> transposon insertion frequencies. Nucleic Acids Research. 2021;49(8):e44-e44. doi:10.1093/nar/gkab010. PMID:33511407. PMCID:PMC8096211.
DOI: 10.1093/nar/gkab010
PMID: 33511407
PMCID: PMC8096211
Funding: - Chinese National Natural Science Foundation: 31871296
- National Institutes of Health: HD078253
Downloads
- Downloads pagehttps://publications.wenglab.org/TEMP2/download/