snpTree
snpTree constructs phylogenetic trees from concatenated single nucleotide polymorphisms (SNPs) identified in whole genome sequencing (WGS) data to support comparative genomics and outbreak investigations.
Key Features:
- Automated SNP identification: Identifies SNPs from raw reads in fastq format using BWA and from assembled contigs in fasta format using Nucmer.
- SNP concatenation: Concatenates identified SNPs based on their genomic positions relative to a reference genome.
- Phylogenetic tree construction: Constructs phylogenetic trees from concatenated SNPs using FastTree with accompanying Perl scripts.
- Data type versatility: Supports analysis of both raw sequencing reads and assembled genomes/contigs.
Scientific Applications:
- Outbreak investigation and epidemiology: Differentiates closely related outbreak isolates with minimal genetic diversity through precise SNP-based phylogenies.
- Method validation: Applied to bacterial WGS datasets including Vibrio cholerae, Staphylococcus aureus CC398, Salmonella Typhimurium, and Mycobacterium tuberculosis, showing concordant results with published analyses for the first three datasets while not replicating extensive SNP filtering reported for M. tuberculosis.
Methodology:
Align fastq reads to reference genomes with BWA and process fasta contigs with Nucmer; concatenate identified SNPs by reference-based genomic positions; construct phylogenetic trees from concatenated SNPs using FastTree with Perl scripts.
Topics
Details
- License:
- Other
- Maturity:
- Emerging
- Cost:
- Free of charge (with restrictions)
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 5/4/2015
- Last Updated:
- 11/25/2024
Operations
Publications
Leekitcharoenphon P, Kaas RS, Thomsen MC, Friis C, Rasmussen S, Aarestrup FM. snpTree - a web-server to identify and construct SNP trees from whole genome sequence data. BMC Genomics. 2012;13(Suppl 7):S6. doi:10.1186/1471-2164-13-s7-s6. PMID:23281601. PMCID:PMC3521233.