PRINSEQ
PRINSEQ performs quality control and preprocessing of genomic and metagenomic sequence datasets from FASTA (and QUAL) or FASTQ files.
Key Features:
- File format support: Accepts FASTA (and QUAL) and FASTQ input files for sequence processing.
- Filtering: Removes sequences based on user-defined, customizable criteria to exclude low-quality or irrelevant reads.
- Trimming: Trims sequences according to configurable parameters to remove low-quality bases or regions.
- Reformatting: Reformats sequence records and outputs to desired formats for downstream compatibility.
- Summary statistics: Computes comprehensive summary statistics and produces tabular and graphical summaries of sequence data.
- Implementation: Implemented in Perl.
- Customizability: Provides a wide array of configurable options to tailor preprocessing steps.
Scientific Applications:
- Sequencing data quality assessment: Evaluates the integrity of sequence datasets prior to downstream analyses.
- Preprocessing for genomic and metagenomic studies: Refines datasets by removing low-quality or irrelevant sequences to improve accuracy of subsequent analyses.
Methodology:
Performs filtering, reformatting, and trimming of sequences and computes comprehensive summary statistics from FASTA (and QUAL) or FASTQ inputs with tabular and graphical outputs.
Topics
Collections
Details
- License:
- GPL-3.0
- Maturity:
- Mature
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Perl
- Added:
- 1/13/2017
- Last Updated:
- 11/25/2024
Operations
Data Inputs & Outputs
Read pre-processing
Outputs
Publications
Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27(6):863-864. doi:10.1093/bioinformatics/btr026. PMID:21278185. PMCID:PMC3051327.