TEMP2

TEMP2 detects transposon insertions from short-read whole-genome sequencing, precisely identifies germline insertions, estimates frequencies of de novo (singleton) insertions, and discriminates chimeric-read artifacts.


Key Features:

  • High sensitivity and precision: Demonstrates exceptional sensitivity and precision in detecting germline transposon insertions validated on simulated Drosophila data and experimental data from flies and humans.
  • De novo (singleton) insertion detection: Identifies de novo transposon insertions and estimates their frequencies in short-read whole-genome sequencing data.
  • Chimeric-read handling: Distinguishes artificial insertion sites introduced by chimeric reads from true de novo insertions, maintaining accuracy at high chimeric-read levels.
  • Validation with long-read benchmark: Performance is benchmarked against a PacBio long-read sequencing dataset from Drosophila.

Scientific Applications:

  • Understanding genomic instability: Enables investigation of how new transposon insertions contribute to genomic instability.
  • Studying evolutionary dynamics: Supports analysis of the role of transposons in host genome evolution.
  • Investigating hybrid dysgenesis and transposon regulation: Applied to cases such as hybrid dysgenic flies with de-repressed P-elements to track continuous new insertions prior to re-establishment of piRNA-mediated repression.

Methodology:

Leverages a PacBio long-read sequencing–based benchmark dataset from Drosophila for validation of detection and frequency estimation.

Topics

Details

Tool Type:
command-line tool, library
Programming Languages:
Shell, Perl, R
Added:
3/19/2021
Last Updated:
11/24/2024

Operations

Publications

Yu T, Huang X, Dou S, Tang X, Luo S, Theurkauf WE, Lu J, Weng Z. A benchmark and an algorithm for detecting germline transposon insertions and measuring <i>de novo</i> transposon insertion frequencies. Nucleic Acids Research. 2021;49(8):e44-e44. doi:10.1093/nar/gkab010. PMID:33511407. PMCID:PMC8096211.

PMID: 33511407
PMCID: PMC8096211
Funding: - Chinese National Natural Science Foundation: 31871296 - National Institutes of Health: HD078253

Downloads