CEGMA

CEGMA is a software tool that enables the construction of a reliable set of gene annotations in the absence of experimental data. It uses a computational method that defines a set of conserved protein families occurring in a wide range of eukaryotes and presents a mapping procedure that accurately identifies their exon-intron structures in a novel genomic sequence. The tool includes the use of profile-hidden Markov models to ensure the reliability of the gene structures, allowing researchers to build an initial set of reliable gene annotations in potentially any eukaryotic genome, even those in draft stages.

Topic

Mapping;DNA

Detail

  • Operation: Mapping

  • Software interface: Command-line user interface

  • Language: Perl

  • License: GNU General Public License, version 2

  • Cost: Free

  • Version name: 2.5

  • Credit: National Human Genome Research Institute (

  • Input: -

  • Output: -

  • Contact: korflab@ucdavis.edu

  • Collection: -

  • Maturity: Stable

Publications

  • CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes.
  • Parra G, et al. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. 2007; 23:1061-7. doi: 10.1093/bioinformatics/btm071
  • https://doi.org/10.1093/bioinformatics/btm071
  • PMID: 17332020
  • PMC: -

Download and documentation


< Back to DB search