SAPFIR

SAPFIR identifies protein features affected by alternative splicing by mapping InterProScan-predicted domains, motifs, and sites to transcripts in human and mouse to support functional interpretation of isoform diversity.


Key Features:

  • Predictive Analysis: Uses InterProScan to analyze protein sequences for functional domains, motifs, and sites and links these predicted features to specific alternative splicing events in human and mouse.
  • Data Integration: Integrates transcript data with splice site information to present alternative protein features as functions of transcripts and isoforms.
  • Scalability: Processes large sequence datasets and can analyze proteins from a single gene or multiple genes simultaneously for high-throughput studies.
  • Validation: Rediscovered previously confirmed alternative protein domains, supporting the accuracy of its predictions.
  • Comparative Insights: De novo analysis of public datasets reveals conservation of alternative protein domains between human and mouse and indicates enrichment of domain-losing isoforms in genes involved in nervous system processes, regulation of DNA-templated transcription, and aging.

Scientific Applications:

  • Functional interpretation of alternative splicing: Identify which protein domains, motifs, or sites are gained or lost due to alternative splicing to infer potential functional consequences.
  • Comparative genomics: Assess conservation of alternative protein domains between human and mouse to prioritize conserved splicing events.
  • Biological process and disease studies: Investigate splicing-driven loss of functional domains in research areas such as neurobiology, transcription regulation, and aging.

Methodology:

InterProScan is used to predict domains, motifs, and sites from protein sequences, and these predictions are mapped to transcripts using transcript and splice site information; de novo analyses of public datasets are performed for conservation and enrichment studies.

Topics

Details

License:
GPL-3.0
Cost:
Free of charge
Tool Type:
web application
Operating Systems:
Mac, Linux, Windows
Programming Languages:
Python
Added:
9/3/2022
Last Updated:
11/24/2024

Operations

Publications

Zhou D, Tran Y, Abou Elela S, Scott MS. SAPFIR: A webserver for the identification of alternative protein features. BMC Bioinformatics. 2022;23(1). doi:10.1186/s12859-022-04804-w. PMID:35751026. PMCID:PMC9229502.

PMID: 35751026
PMCID: PMC9229502
Funding: - Natural Sciences and Engineering Research Council of Canada: RGPIN-2018-05412

Links