Skewer

Skewer performs adapter trimming of next-generation sequencing (NGS) reads, identifying and removing adapter sequences from Illumina paired-end, Nextera Long Mate-Pair (LMP), small RNA, genome DNA, and transcriptome RNA/cDNA sequencing data.


Key Features:

  • Bit-masked k-difference matching (dynamic programming): Implements a bit-masked k-difference matching algorithm that enumerates candidate adapter alignments within a specified error threshold with expected time complexity O(kn) and space complexity O(m), where k = maximum allowable differences, n = read length, and m = adapter length.
  • Statistical scoring scheme: Applies a statistical scoring scheme during pattern matching to evaluate candidate alignments for precise adapter identification and trimming.
  • Paired-end and mate-pair optimization: Exploits paired-end and mate-pair information with specialized scoring schemes to improve trimming accuracy, including handling junction adapters located internally as in Nextera LMP.
  • Illumina paired-end specialization: Algorithmic design is tailored specifically for processing Illumina paired-end sequencing data.

Scientific Applications:

  • Small RNA sequencing: Trims adapters from short fragments to enable accurate small RNA analysis.
  • Genome DNA sequencing: Removes adapters from fragments shorter than the read length in genome DNA sequencing workflows.
  • Transcriptome RNA/cDNA sequencing: Handles adapter removal for RNA/cDNA fragments shorter than the read length in transcriptome sequencing.
  • Nextera Long Mate-Pair (LMP) protocol: Trims junction adapters positioned in the middle of fragments to restore correct paired reads in Nextera LMP libraries.

Methodology:

Skewer combines a dynamic programming, bit-masked k-difference matching algorithm with a statistical scoring scheme and has been validated on simulated and real datasets from small RNA sequencing, paired-end RNA sequencing, and Nextera LMP sequencing.

Topics

Details

Maturity:
Mature
Tool Type:
command-line tool
Operating Systems:
Linux
Programming Languages:
C++
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Publications

Jiang H, Lei R, Ding S, Zhu S. Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads. BMC Bioinformatics. 2014;15(1). doi:10.1186/1471-2105-15-182. PMID:24925680. PMCID:PMC4074385.

Documentation