CD-Search
CD-Search identifies conserved structural and functional domains in protein sequences to support annotation, evolutionary analysis, and structural interpretation.
Key Features:
- Domain Detection: CD-Search employs RPS-BLAST and Position Specific Score Matrices (PSSMs) derived from Conserved Domain Database (CDD) alignments to identify conserved domains by aligning queries to domain-model consensus sequences.
- BLAST heuristics: It leverages BLAST heuristics to perform fast searches through the collection of domain models in the CDD.
- Visualization: Results include domain architecture cartoons, pairwise alignments between query and domain models, multiple alignment displays, and links to three-dimensional molecular graphics of known structures within domain families.
- Integration with Entrez: The CDD is indexed as a standalone database within the Entrez system, enabling cross-references to other Entrez resources such as MEDLINE and sequence records.
- Comprehensive Domain Models: The CDD mirrors public collections such as SMART and PFAM and includes NCBI-curated models structured to reflect conserved core substructures and preserved functional sites among family members.
- Automatic BLAST searches: CD-Search is run automatically by default for protein-protein queries submitted to the BLAST service.
Scientific Applications:
- Domain annotation: Identification of conserved domains to annotate protein sequences and locate preserved functional sites.
- Function prediction: Inferring protein function from detected domain architectures and conserved motifs.
- Evolutionary analysis: Assessing evolutionary relationships and domain architecture conservation among protein families.
- Structural interpretation: Linking domain models to known three-dimensional structures to support structural biology studies.
- Cross-database research integration: Combining domain annotations with Entrez and MEDLINE records to connect sequence features with literature and other database entries.
Methodology:
CD-Search uses RPS-BLAST with PSSMs derived from CDD alignments and BLAST heuristics to align query protein sequences to domain-model consensus sequences; the CDD is indexed within Entrez and CD-Search is run by default for protein–protein BLAST queries.
Topics
Details
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 3/24/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Marchler-Bauer A, Bryant SH. CD-Search: protein domain annotations on the fly. Nucleic Acids Research. 2004;32(Web Server):W327-W331. doi:10.1093/nar/gkh454. PMID:15215404. PMCID:PMC441592.
Marchler-Bauer A. CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Research. 2003;31(1):383-387. doi:10.1093/nar/gkg087. PMID:12520028. PMCID:PMC165534.