CENSOR (EBI)

CENSOR (EBI) identifies and masks known protein and nucleotide sequence repeats by screening query sequences against a curated reference collection to obscure homologous repetitive regions and produce classified repeat reports.


Key Features:

  • Repeat Identification: Screens protein and nucleotide query sequences against a curated reference collection of known repeats to detect homologous repetitive regions.
  • Masking Functionality: Applies designated masking symbols to homologous repeat regions to obscure repetitive elements within sequences.
  • Reporting Capabilities: Generates detailed reports that classify and detail all identified repeats.
  • EMBL-EBI Integration: Leverages integrated EMBL-EBI repeat reference collections and bioinformatics resources to support screening and reporting.

Scientific Applications:

  • Genomics: Masks repetitive elements to improve analysis of unique regions in genomic sequences.
  • Proteomics: Detects and masks protein sequence repeats to aid proteomic analyses.
  • Comparative studies: Removes confounding repetitive regions to support comparative sequence analyses.
  • Genetic variation research: Helps clarify variant interpretation by masking repeats that can confound analyses.
  • Structural biology: Identifies and masks repeats that may affect structural interpretation of sequences.

Methodology:

CENSOR screens query sequences against a reference collection of known repeats, identifies homologous repeat regions, applies masking symbols to those regions, and produces classified repeat reports, supported by integrated EMBL-EBI bioinformatics resources.

Topics

Collections

Details

Maturity:
Legacy
Tool Type:
api, web application
Operating Systems:
Linux, Windows, Mac
Added:
1/29/2015
Last Updated:
11/24/2024

Operations

Publications

Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Research. 2022;50(W1):W276-W279. doi:10.1093/nar/gkac240. PMID:35412617. PMCID:PMC9252731.

PMID: 35412617
PMCID: PMC9252731
Funding: - EMBL-EBI: 824087 - BY-COVID: 101046203 - EarlyCause: 848158

Cook CE, Bergman MT, Finn RD, Cochrane G, Birney E, Apweiler R. The European Bioinformatics Institute in 2016: Data growth and integration. Nucleic Acids Research. 2015;44(D1):D20-D26. doi:10.1093/nar/gkv1352. PMID:26673705. PMCID:PMC4702932.

Documentation

Links