Kollector

Kollector performs targeted de novo assembly of gene loci guided by transcript sequences to reconstruct introns and novel genomic regions.


Key Features:

  • Alignment-Free Assembly: Operates without alignment by utilizing thousands of transcript sequences concurrently to guide localized assembly.
  • Scalability: Handles large eukaryotic genomes and is applicable to non-model eukaryotic organisms.
  • Performance Efficiency: Reconstructed 99% of Caenorhabditis elegans and 80% of Homo sapiens transcript targets from whole genome shotgun reads, compared to 86% and 73% for standard de novo assembly, while reducing time and memory requirements.
  • Versatility in Applications: Applicable to diverse analyses that require locus-level reconstruction, including reconstruction of introns and novel sequences within gene loci.

Scientific Applications:

  • Comparative Genomics: Enables reconstruction of gene loci for analysis of genetic variation across species.
  • Cancer Genomics: Facilitates locus-level reconstruction to investigate mutations at specific gene loci relevant to oncogenesis.
  • Gene-Centric Studies: Supports focused reconstruction of genomic regions for detailed gene-by-gene analyses in eukaryotes.

Methodology:

Kollector uses a targeted, alignment-free assembly strategy that leverages thousands of transcript sequences concurrently to guide localized reconstruction of specific genomic regions rather than whole genomes, enabling recovery of introns and novel sequences while reducing computational overhead.

Topics

Details

Tool Type:
command-line tool
Operating Systems:
Linux, Mac
Programming Languages:
Shell
Added:
6/5/2018
Last Updated:
11/25/2024

Operations

Publications

Kucuk E, Chu J, Vandervalk BP, Hammond SA, Warren RL, Birol I. Kollector: transcript-informed, targeted <i>de novo</i> assembly of gene loci. Bioinformatics. 2017;33(12):1782-1788. doi:10.1093/bioinformatics/btx078. PMID:28186221. PMCID:PMC5572715.

Documentation