GenSeed
GenSeed performs target-specific DNA assembly using a seed-driven recursive approach to progressively extend initial DNA or protein seed sequences for reconstructing genomic segments and transcripts.
Key Features:
- Implementation: Implemented as a Perl program.
- Seed-driven recursive assembly: Uses a seed-driven recursive strategy that focuses on progressive extension of an initial seed sequence rather than all-against-all comparisons.
- Seed types supported: Accepts predefined seed sequences derived from DNA or protein data, including short sequences from SAGE or proteome projects.
- Target-specific focus: Optimized for assembling specific genomic segments instead of whole genomes or transcriptomes.
- Adaptability to diverse seeds: Handles a diverse array of seed types for different research needs.
Scientific Applications:
- Nuclear gene or segment reconstruction: Reconstruction of nuclear genes or genomic segments from sequencing reads.
- Full-length transcript reconstruction: Assembly of full-length transcripts starting from transcript-derived seeds.
- Extrachromosomal genome reconstruction: Assembly of extrachromosomal genomes using targeted seed extension.
Methodology:
Starting from a predefined DNA or protein seed (including SAGE or proteome-derived short sequences), GenSeed iteratively performs similarity search, read selection, and assembly in recursive cycles to progressively extend and refine the seed-derived assembly.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux
- Programming Languages:
- Perl
- Added:
- 12/18/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Sobreira TJP, Gruber A. Sequence-specific reconstruction from fragmentary databases using seed sequences: implementation and validation on SAGE, proteome and generic sequencing data. Bioinformatics. 2008;24(15):1676-1680. doi:10.1093/bioinformatics/btn283. PMID:18544546.