SolexaQA

SolexaQA assesses base-level quality and performs dynamic read trimming to evaluate and improve Illumina sequencing data for downstream genomic and transcriptomic analyses.


Key Features:

  • Automated Quality Statistics: Generates detailed quality statistics from base-level quality scores across reads.
  • Visual Graphics for Data Quality: Produces visualizations of quality metrics to represent per-base and per-read quality for comparison across datasets, flow cell lanes, and machine runs.
  • Dynamic Sequence Trimming: Trims low-quality bases from reads based on individual base quality scores using algorithmic thresholds.
  • Standardized Outputs: Produces standardized output files that support direct comparison between flow cell lanes and sequencing runs.
  • Diagnostic Information: Provides diagnostic quality metrics to inform decisions about data retention and manipulation for downstream analyses.

Scientific Applications:

  • Large-scale genomic studies: Assess and filter Illumina sequencing reads to improve data quality for genome assembly and variant analysis.
  • Transcriptomic analyses: Evaluate and trim RNA-seq reads to enhance reliability of expression quantification and downstream analyses.
  • DNA and RNA sequencing workflows: Provide quality assessment and trimming prior to downstream bioinformatics processing to reduce artefacts and noise.

Methodology:

Computationally parses Illumina base-level quality scores to generate quality statistics and graphics and applies algorithms to dynamically trim low-quality bases from reads.

Topics

Details

License:
GPL-3.0
Maturity:
Mature
Tool Type:
workflow
Operating Systems:
Linux, Mac
Programming Languages:
R, Perl
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Data Inputs & Outputs

Publications

Cox MP, Peterson DA, Biggs PJ. SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics. 2010;11(1). doi:10.1186/1471-2105-11-485. PMID:20875133. PMCID:PMC2956736.

Documentation