FastQC
FastQC performs quality control analysis on high-throughput sequencing FastQ files to identify sequencing errors, biases, adapter contamination, and other metrics relevant to downstream genomic analyses.
Key Features:
- Input format support: Accepts FastQ files produced by high-throughput sequencing workflows.
- Dual-Mode Operation: Supports interactive standalone execution and non-interactive batch operation for integration into automated pipelines.
- Galaxy integration: Executes within the Galaxy platform and integrates via the Galaxy API or the Bioblend library.
- Quality metrics generated: Reports per-base sequence quality, per-sequence quality scores, GC content distribution, overrepresented sequences, and adapter contamination.
- Anomaly detection: Identifies issues such as poor quality scores, adapter contamination, and overrepresented sequences.
- Report generation: Produces comprehensive QC reports summarizing computed metrics and detected anomalies.
Scientific Applications:
- Identifying sequencing errors: Detects anomalies such as low-quality bases, adapter contamination, and overrepresented sequences in sequencing runs.
- Ensuring data integrity: Verifies consistency and reliability of sequencing data prior to downstream analyses like variant calling or transcriptome assembly.
- Optimizing experimental design: Provides metrics that inform improvements to library preparation and sequencing protocols.
Methodology:
FastQC analyzes FastQ files to compute metrics including per-base sequence quality, per-sequence quality scores, GC content distribution, and the presence of overrepresented sequences and adapter contamination, and compiles these results into a comprehensive report.
Topics
Collections
Details
- License:
- GPL-3.0
- Maturity:
- Mature
- Tool Type:
- command-line tool, desktop application
- Operating Systems:
- Linux, Windows
- Programming Languages:
- Java
- Added:
- 1/13/2017
- Last Updated:
- 10/18/2021
Operations
Publications
Mareuil F, Doppelt-Azeroual O, Ménager H. A public Galaxy platform at Pasteur used as an execution engine for web services. Unknown Journal. 2017. doi:10.7490/f1000research.1114334.1.
Documentation
Links
Repository
https://github.com/csf-ngs/fastqcGalaxy service
https://galaxy.pasteur.fr/tool_runner?tool_id=fastqc