breseq
breseq detects structural variations and sequence-level mutations in haploid microbial genomes from short-read DNA resequencing data for identification and annotation of genomic rearrangements relative to a reference genome.
Key Features:
- Split-read alignment analysis: Detects mutations by analyzing split-read alignments against a reference genome to identify breakpoints and novel junctions.
- Novel sequence-junction detection: Evaluates and predicts new sequence junctions, including those involving repetitive elements.
- Statistical coverage model: Uses a statistical model assessing read coverage evenness to refine predictions and reduce false positives.
- Integration of junctions and deletions: Integrates novel junction predictions with deletions in chromosomal regions to produce coherent mutation calls.
- Annotation of gene impacts: Provides biologically meaningful descriptions of mutations and their impacts on genes.
- Detection of mobile element events: Identifies mobile genetic element insertions and deletions mediated by these elements, including transposon insertions and large-scale chromosomal changes.
- Validated on Escherichia coli: Demonstrated on simulated Escherichia coli genomes with unique breakpoint sequences and complex mutational events.
- Reference-genome requirement: Operates relative to a closely related reference genome for accurate mutation calling.
- Read-depth sensitivity: Capable of reliable predictions with modest short-read coverage (greater than ~40-fold).
Scientific Applications:
- Microbial epidemiology: Detects structural variants and insertions relevant to pathogen evolution and outbreak investigation.
- Experimental evolution: Identifies complex mutations arising during evolution experiments, including rearrangements and mobile element activity.
- Synthetic biology: Characterizes engineered genomic changes and unintended rearrangements in microbial strains.
- Genetics and mutation discovery: Reanalyzes mutation-accumulation datasets to quantify contributions of structural variations, such as the ~25% contribution of transposon insertions and large-scale changes reported for E. coli K-12.
Methodology:
Analyzes short-read DNA resequencing data using split-read alignments to detect novel sequence junctions, applies a statistical model assessing read coverage evenness to refine calls, and integrates junction predictions with deletion calls to annotate mutation impacts.
Topics
Collections
Details
- Maturity:
- Mature
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 12/19/2016
- Last Updated:
- 11/25/2024
Operations
Data Inputs & Outputs
Polymorphism detection
Outputs
Publications
Barrick JE, Colburn G, Deatherage DE, Traverse CC, Strand MD, Borges JJ, Knoester DB, Reba A, Meyer AG. Identifying structural variation in haploid microbial genomes from short-read resequencing data using breseq. BMC Genomics. 2014;15(1):1039. doi:10.1186/1471-2164-15-1039. PMID:25432719. PMCID:PMC4300727.
Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Eberhard C, Grüning B, Guerler A, Hillman-Jackson J, Von Kuster G, Rasche E, Soranzo N, Turaga N, Taylor J, Nekrutenko A, Goecks J. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Research. 2016;44(W1):W3-W10. doi:10.1093/nar/gkw343. PMID:27137889. PMCID:PMC4987906.
Mareuil F, Doppelt-Azeroual O, Ménager H. A public Galaxy platform at Pasteur used as an execution engine for web services. Unknown Journal. 2017. doi:10.7490/f1000research.1114334.1.