RSAT suite
RSAT suite provides computational methods to identify and analyze regulatory signals in non-coding DNA sequences, including motif discovery, genome-scale pattern matching, and statistical assessment of oligonucleotide over-representation.
Key Features:
- Pattern Discovery and Matching: Implements string-based and matrix-based methods for discovering known and de novo regulatory patterns and detecting over-represented oligonucleotides, with multiple pattern-discovery algorithms.
- Genome-Scale Analysis: Performs genome-scale pattern matching using a database of over 100 fully sequenced genomes updated from GenBank, NCBI, and Ensembl.
- Feature Map Drawing: Generates feature maps that annotate positions of identified regulatory patterns.
- Random Sequence Generation: Produces random sequences using Bernoulli and Markov chain background models for control experiments and statistical assessment.
- Web Services Integration: Exposes programmatic web services for integration into automated analysis workflows.
- Phylogenetic Footprint Detection: Provides matrix-based detection of cis-acting elements with adaptive background models and Markov-chain estimation of P-values.
Scientific Applications:
- Transcription factor binding site identification: Detects DNA binding sites for transcription factors across genomes, including Saccharomyces cerevisiae.
- Regulatory element prediction: Predicts unknown regulatory elements and motifs from non-coding sequences.
- Motif validation: Assesses and validates experimentally discovered motifs using statistical significance derived from oligonucleotide frequencies.
- Comparative regulatory analysis: Supports genome-wide and cross-organism analyses of regulatory sequences using available sequenced genomes.
Methodology:
Performs exhaustive oligonucleotide analysis focused on over-represented sequences in upstream regions of co-regulated genes; applies string-based and matrix-based pattern detection; uses adaptive background models and Markov-chain estimation of P-values; generates random sequences with Bernoulli and Markov chain models and defines significance from observed oligonucleotide frequencies across non-coding yeast genome sequences.
Topics
Collections
Details
- License:
- Other
- Tool Type:
- web application, workflow
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Java, Perl, Python
- Added:
- 2/10/2017
- Last Updated:
- 11/25/2024
Operations
Data Inputs & Outputs
Publications
van Helden J, Andr� B, Collado-Vides J. A web site for the computational analysis of yeast regulatory sequences. Yeast. 2000;16(2):177-187. doi:10.1002/(sici)1097-0061(20000130)16:2<177::aid-yea516>3.0.co;2-9. PMID:10641039.
van Helden J. Regulatory Sequence Analysis Tools. Nucleic Acids Research. 2003;31(13):3593-3596. doi:10.1093/nar/gkg567. PMID:12824373. PMCID:PMC168973.
van Helden J, André B, Collado-Vides J. Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies 1 1Edited by G. von Heijne. Journal of Molecular Biology. 1998;281(5):827-842. doi:10.1006/jmbi.1998.1947. PMID:9719638.
Thomas-Chollier M, Sand O, Turatsinze J, Janky R, Defrance M, Vervisch E, Brohee S, van Helden J. RSAT: regulatory sequence analysis tools. Nucleic Acids Research. 2008;36(Web Server):W119-W127. doi:10.1093/nar/gkn304. PMID:18495751. PMCID:PMC2447775.