Folddisco

Folddisco detects discontinuous or segmental three-dimensional (3D) protein motifs in large structural databases to identify functionally significant short 3D patterns.


Key Features:

  • Position-Independent Geometric Indexing: Uses an index based on position-independent geometric features, including side-chain orientation, to represent discontinuous 3D motifs.
  • Rarity-Based Scoring System: Implements a rarity-based scoring system to prioritize less common structural patterns that may be functionally significant.
  • Efficiency and Scalability: Indexes 53 million AFDB50 structures into a compact 1.45 terabyte database within 24 hours, achieving roughly an order of magnitude speed improvement over existing methods.
  • Accuracy and Storage Efficiency: Reports superior accuracy in detecting discontinuous motifs while being more storage-efficient compared with other available tools.

Scientific Applications:

  • Motif Discovery: Identification of functionally crucial short 3D patterns within protein structures.
  • Protein Function and Interaction Analysis: Facilitates understanding of protein function and interaction in structural biology research.
  • Large-Scale Structural Database Mining: Enables motif discovery across large databases such as AFDB50.

Methodology:

Constructs a position-independent geometric index incorporating side-chain orientation, applies a rarity-based scoring system to rank motifs, and indexes AFDB50 structures (53 million) into a compact 1.45 terabyte database within 24 hours.

Topics

Details

License:
GPL-3.0
Maturity:
Emerging
Cost:
Free of charge
Tool Type:
command-line tool, web application
Operating Systems:
Linux, Mac
Programming Languages:
Rust
Added:
11/10/2025
Last Updated:
11/10/2025

Operations

Publications

Links