SNAP

SNAP predicts gene structures in genomic sequences using a Semi-Hidden Markov Model (Semi-HMM) framework to enable computational gene prediction in genomes with limited experimental data.


Key Features:

  • Semi-Hidden Markov Model (Semi-HMM): Uses a Semi-HMM framework for parsing nucleic acid sequences and modeling gene structure.
  • Nucleic acid sequence parsing: Parses genomic nucleic acid sequences to identify gene features and boundaries.
  • Species-specific parameterization: Emphasizes estimation and use of species-specific parameters to improve prediction accuracy.
  • Bootstrapped parameter estimation: Employs foreign gene finders to bootstrap parameter estimation rather than using them as direct predictors.
  • Adaptability to diverse genomes: Designed to be adaptable to various genomes, including those with limited experimental data.

Scientific Applications:

  • Computational gene prediction: Predicts gene structures in genomic sequences where experimental annotation is sparse or absent.
  • Functional annotation support: Provides predicted gene models for downstream functional annotation of genes.
  • Comparative genomics: Supplies gene predictions for comparative genomics analyses across species.
  • Evolutionary biology: Enables evolutionary studies that rely on inferred gene structures and gene content comparisons.

Methodology:

Employs a Semi-Hidden Markov Model to parse nucleic acid sequences and predict gene structures, and bootstraps species-specific parameter estimation using foreign gene finders rather than treating them as direct predictors.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
C
Added:
8/3/2017
Last Updated:
11/24/2024

Operations

Data Inputs & Outputs

Gene prediction

Publications

Korf I. Gene finding in novel genomes. BMC Bioinformatics. 2004;5(1). doi:10.1186/1471-2105-5-59. PMID:15144565. PMCID:PMC421630.

Documentation

Links