PRINSEQ

PRINSEQ performs quality control and preprocessing of genomic and metagenomic sequence datasets from FASTA (and QUAL) or FASTQ files.


Key Features:

  • File format support: Accepts FASTA (and QUAL) and FASTQ input files for sequence processing.
  • Filtering: Removes sequences based on user-defined, customizable criteria to exclude low-quality or irrelevant reads.
  • Trimming: Trims sequences according to configurable parameters to remove low-quality bases or regions.
  • Reformatting: Reformats sequence records and outputs to desired formats for downstream compatibility.
  • Summary statistics: Computes comprehensive summary statistics and produces tabular and graphical summaries of sequence data.
  • Implementation: Implemented in Perl.
  • Customizability: Provides a wide array of configurable options to tailor preprocessing steps.

Scientific Applications:

  • Sequencing data quality assessment: Evaluates the integrity of sequence datasets prior to downstream analyses.
  • Preprocessing for genomic and metagenomic studies: Refines datasets by removing low-quality or irrelevant sequences to improve accuracy of subsequent analyses.

Methodology:

Performs filtering, reformatting, and trimming of sequences and computes comprehensive summary statistics from FASTA (and QUAL) or FASTQ inputs with tabular and graphical outputs.

Topics

Collections

Details

License:
GPL-3.0
Maturity:
Mature
Tool Type:
web application
Operating Systems:
Linux, Windows, Mac
Programming Languages:
Perl
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Data Inputs & Outputs

Read pre-processing

Publications

Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27(6):863-864. doi:10.1093/bioinformatics/btr026. PMID:21278185. PMCID:PMC3051327.

Documentation