Afann
Afann adjusts alignment-free sequence dissimilarity measures using neural network regression to correct bias and enable accurate comparison of genome sequences or raw sequencing samples without assembly.
Key Features:
- Alignment-Free Comparison: Rapid calculation of dissimilarity measures including d2star, d2shepp, CVtree, Manhattan, Euclidean, and d2 for genome sequences or raw sequencing samples without assembly.
- Bias Adjustment: Correction of biases inherent in alignment-free dissimilarity calculations derived from sequencing data to mitigate overestimation when comparing sequencing samples directly rather than assembled genomes.
- Neural Network Regression: Use of neural network regression to refine and correct calculated dissimilarities for improved accuracy.
- Computational Efficiency: Time-efficient and memory-conservative operation suitable for large-scale sequence comparison.
Scientific Applications:
- Comparative Genomics: Provides bias-adjusted dissimilarity measures to support accurate comparisons between genomes and within populations.
- Microbiome Analysis: Enables analysis of complex microbiome sequencing samples without assembly, reducing computational burden from alignment-based methods.
- Evolutionary Studies: Facilitates exploration of evolutionary relationships and genetic diversity using alignment-free metrics.
Methodology:
Calculate dissimilarity measures from sequencing samples using alignment-free methods, then apply a neural network regression model to adjust those measures and correct biases that cause overestimation relative to genome-based calculations.
Topics
Details
- Programming Languages:
- C++, Python
- Added:
- 1/14/2020
- Last Updated:
- 12/1/2020
Operations
Publications
Tang K, Ren J, Sun F. Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression. Genome Biology. 2019;20(1). doi:10.1186/s13059-019-1872-3. PMID:31801606. PMCID:PMC6891986.