RSAT suite

RSAT suite provides computational methods to identify and analyze regulatory signals in non-coding DNA sequences, including motif discovery, genome-scale pattern matching, and statistical assessment of oligonucleotide over-representation.


Key Features:

  • Pattern Discovery and Matching: Implements string-based and matrix-based methods for discovering known and de novo regulatory patterns and detecting over-represented oligonucleotides, with multiple pattern-discovery algorithms.
  • Genome-Scale Analysis: Performs genome-scale pattern matching using a database of over 100 fully sequenced genomes updated from GenBank, NCBI, and Ensembl.
  • Feature Map Drawing: Generates feature maps that annotate positions of identified regulatory patterns.
  • Random Sequence Generation: Produces random sequences using Bernoulli and Markov chain background models for control experiments and statistical assessment.
  • Web Services Integration: Exposes programmatic web services for integration into automated analysis workflows.
  • Phylogenetic Footprint Detection: Provides matrix-based detection of cis-acting elements with adaptive background models and Markov-chain estimation of P-values.

Scientific Applications:

  • Transcription factor binding site identification: Detects DNA binding sites for transcription factors across genomes, including Saccharomyces cerevisiae.
  • Regulatory element prediction: Predicts unknown regulatory elements and motifs from non-coding sequences.
  • Motif validation: Assesses and validates experimentally discovered motifs using statistical significance derived from oligonucleotide frequencies.
  • Comparative regulatory analysis: Supports genome-wide and cross-organism analyses of regulatory sequences using available sequenced genomes.

Methodology:

Performs exhaustive oligonucleotide analysis focused on over-represented sequences in upstream regions of co-regulated genes; applies string-based and matrix-based pattern detection; uses adaptive background models and Markov-chain estimation of P-values; generates random sequences with Bernoulli and Markov chain models and defines significance from observed oligonucleotide frequencies across non-coding yeast genome sequences.

Topics

Collections

Details

License:
Other
Tool Type:
web application, workflow
Operating Systems:
Linux, Windows, Mac
Programming Languages:
Java, Perl, Python
Added:
2/10/2017
Last Updated:
11/25/2024

Operations

Data Inputs & Outputs

Data retrieval

Inputs

Outputs

Publications

van Helden J, Andr� B, Collado-Vides J. A web site for the computational analysis of yeast regulatory sequences. Yeast. 2000;16(2):177-187. doi:10.1002/(sici)1097-0061(20000130)16:2<177::aid-yea516>3.0.co;2-9. PMID:10641039.

van Helden J. Regulatory Sequence Analysis Tools. Nucleic Acids Research. 2003;31(13):3593-3596. doi:10.1093/nar/gkg567. PMID:12824373. PMCID:PMC168973.

van Helden J, André B, Collado-Vides J. Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies 1 1Edited by G. von Heijne. Journal of Molecular Biology. 1998;281(5):827-842. doi:10.1006/jmbi.1998.1947. PMID:9719638.

Thomas-Chollier M, Sand O, Turatsinze J, Janky R, Defrance M, Vervisch E, Brohee S, van Helden J. RSAT: regulatory sequence analysis tools. Nucleic Acids Research. 2008;36(Web Server):W119-W127. doi:10.1093/nar/gkn304. PMID:18495751. PMCID:PMC2447775.

Documentation

Related Tools

retrieve_matrix
Relation: includes
rsat-position-analysis
Relation: includes
rsat_-_retrieve_sequence
Relation: includes
rsat_peak-motifs
Relation: includes
rsat_var-tools
Relation: includes