SolexaQA
SolexaQA assesses base-level quality and performs dynamic read trimming to evaluate and improve Illumina sequencing data for downstream genomic and transcriptomic analyses.
Key Features:
- Automated Quality Statistics: Generates detailed quality statistics from base-level quality scores across reads.
- Visual Graphics for Data Quality: Produces visualizations of quality metrics to represent per-base and per-read quality for comparison across datasets, flow cell lanes, and machine runs.
- Dynamic Sequence Trimming: Trims low-quality bases from reads based on individual base quality scores using algorithmic thresholds.
- Standardized Outputs: Produces standardized output files that support direct comparison between flow cell lanes and sequencing runs.
- Diagnostic Information: Provides diagnostic quality metrics to inform decisions about data retention and manipulation for downstream analyses.
Scientific Applications:
- Large-scale genomic studies: Assess and filter Illumina sequencing reads to improve data quality for genome assembly and variant analysis.
- Transcriptomic analyses: Evaluate and trim RNA-seq reads to enhance reliability of expression quantification and downstream analyses.
- DNA and RNA sequencing workflows: Provide quality assessment and trimming prior to downstream bioinformatics processing to reduce artefacts and noise.
Methodology:
Computationally parses Illumina base-level quality scores to generate quality statistics and graphics and applies algorithms to dynamically trim low-quality bases from reads.
Topics
Details
- License:
- GPL-3.0
- Maturity:
- Mature
- Tool Type:
- workflow
- Operating Systems:
- Linux, Mac
- Programming Languages:
- R, Perl
- Added:
- 1/13/2017
- Last Updated:
- 11/25/2024
Operations
Data Inputs & Outputs
Sequence trimming
Publications
Cox MP, Peterson DA, Biggs PJ. SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics. 2010;11(1). doi:10.1186/1471-2105-11-485. PMID:20875133. PMCID:PMC2956736.