TRDistiller

TRDistiller filters protein sequences to identify and enrich candidates containing tandem repeats (TRs) for large-scale proteome analysis.


Key Features:

  • Rapid Filtering Mechanism: Compares composition and order of short strings in adjacent sequence motifs to distinguish TR-containing sequences from non-TR sequences.
  • Efficiency and Sensitivity: Retains 99.2% of TR-containing sequences while discarding up to 22.5% of proteins lacking tandem repeats.
  • Time Efficiency: Provides a rapid solution suitable for extensive proteome datasets compared with other sensitive algorithms that are time-consuming.

Scientific Applications:

  • Proteome Analysis: Filters out non-TR proteins to reduce dataset size and prioritize sequences for subsequent tandem-repeat detection methods.
  • Structural Biology Research: Identifies proteins with periodic sequences that may fold into elongated fibrous structures, supporting study of protein structure–function relationships.

Methodology:

Performs comparative analysis of the composition and order of short adjacent sequence motifs to differentiate sequences with and without tandem repeats, thereby reducing computational load during initial dataset screening.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux, Mac
Added:
8/3/2017
Last Updated:
11/25/2024

Operations

Data Inputs & Outputs

Protein sequence analysis

Publications

Richard FD, Kajava AV. TRDistiller: A rapid filter for enrichment of sequence datasets with proteins containing tandem repeats. Journal of Structural Biology. 2014;186(3):386-391. doi:10.1016/j.jsb.2014.03.013. PMID:24681324.

Documentation

Links