Protein Sequence Logos

Protein Sequence Logos visualizes aligned RNA and protein sequences as sequence and structure logos to quantify per-position information content and, for base-paired RNA, mutual information between paired bases.


Key Features:

  • Schneider and Stephens (1990) foundation: Implements the sequence logo representation following Schneider and Stephens (1990).
  • Incorporation of Prior Frequencies: Integrates prior frequencies of bases or amino acids to adjust information content calculations using background symbol distributions.
  • Handling Gaps in Alignments: Accommodates gaps within alignments to represent insertions and deletions in evolutionary comparisons.
  • Mutual Information Calculation: Calculates mutual information between paired bases for RNA alignments when base pairings are indicated.
  • Structure Logos for Base-Paired Regions: Generates Structure Logos that display both sequence and structural information for base-paired RNA regions.
  • Information Content Visualization: Represents each character height proportional to its frequency at a position, sorts characters from most to least frequent, and sets the total stack height to the information content measured in bits.

Scientific Applications:

  • Consensus Sequence Determination: Determine consensus sequences by identifying the most prominent characters at each alignment position.
  • Analysis of Sequence Variability: Highlight positions with high variability to identify potential sites of functional or evolutionary significance.
  • Functional and Structural Insights: Use mutual information and structure logos to infer how sequence variation affects RNA structure and function.

Methodology:

Characters are stacked at each alignment position with heights proportional to symbol frequency and sorted from most to least common; the total stack height equals information content in bits, and mutual information is computed for annotated base-paired RNA positions.

Topics

Collections

Details

License:
Other
Tool Type:
web application
Operating Systems:
Linux, Windows, Mac
Added:
12/6/2017
Last Updated:
11/24/2024

Operations

Publications

Gorodkin J, Heyer L, Brunak S, Storomo G. Displaying the in formation contents of structural RNA alignments: the structure logos. Bioinformatics. 1997;13(6):583-586. doi:10.1093/bioinformatics/13.6.583. PMID:9475985.

Schneider TD, Stephens R. Sequence logos: a new way to display consensus sequences. Nucleic Acids Research. 1990;18(20):6097-6100. doi:10.1093/nar/18.20.6097. PMID:2172928. PMCID:PMC332411.

Documentation