FastqPuri
FastqPuri preprocesses RNA-seq and other short-read sequencing data by performing quality control, adapter trimming, contamination removal, and quality filtering on FASTQ files.
Key Features:
- Comprehensive Quality Control and Filtering: Executes adapter trimming, contaminant removal, and read-quality filtering to prepare high-quality reads for transcript or gene quantification.
- Contaminant Detection and Removal: Efficiently removes biological contaminants and adapter sequences using flexible filtering strategies optimized for contaminant sequence size.
- Format Versatility: Processes single-end and paired-end reads from compressed or uncompressed FASTQ files.
Scientific Applications:
- High-Throughput Sequencing Preprocessing: Improves data quality for RNA-seq analysis, genome assembly, and single nucleotide variant (SNV) detection by eliminating low-quality and contaminant reads.
Methodology:
FastqPuri applies integrated quality assessment, adapter trimming, and contaminant filtering algorithms implemented in C and R, using size-aware filtering strategies to optimize speed, memory usage, and preprocessing accuracy for short-read datasets.
Topics
Details
- License:
- GPL-3.0
- Maturity:
- Emerging
- Cost:
- Free of charge
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Mac
- Programming Languages:
- R, C
- Added:
- 6/20/2019
- Last Updated:
- 6/16/2020
Operations
Publications
Pérez-Rubio P, Lottaz C, Engelmann JC. FastqPuri: high-performance preprocessing of RNA-seq data. BMC Bioinformatics. 2019;20(1). doi:10.1186/s12859-019-2799-0. PMID:31053060. PMCID:PMC6500068.