WEP

WEP performs comprehensive Whole Exome Sequencing (WES) analysis of Next Generation Sequencing (NGS) data, producing quality-controlled alignments, SNP and indel calls, and annotated variants for exome-focused studies.


Key Features:

  • Complete WES Pipeline: Provides an integrated workflow covering initial quality checks through final variant annotation.
  • Input Verification and Quality Control: Implements trimming and filtering processes to ensure integrity and quality of input data.
  • Gapped Alignment: Aligns sequencing reads to reference genomes while accommodating gaps in alignments.
  • BAM File Handling: Converts, sorts, and indexes BAM files for downstream processing.
  • Duplicate Removal: Eliminates redundant reads to improve accuracy of variant calling.
  • Indel Optimization and Quality Score Recalibration: Refines alignments around insertion/deletion sites and recalibrates base quality scores.
  • SNP and Indel Detection: Identifies single nucleotide polymorphisms (SNPs) and insertion/deletion polymorphisms (indels).
  • Variant Annotation: Provides detailed annotations to contextualize genetic variants within biological frameworks.
  • Customizable Data Filtering: Applies user-definable thresholds for variant filtering with default values based on recent literature.
  • Support for Paired and Single-End Sequencing Data: Processes both paired-end and single-end sequencing reads.
  • Cross-Linking and Statistical Analysis via Custom Databases: Stores results in custom databases enabling cross-linking, intersections, and statistical analyses.

Scientific Applications:

  • High-penetrance Variant Analysis: Facilitates identification and interpretation of high-penetrance allelic variations and their phenotypic implications.
  • Exome-focused Eukaryotic Genomics: Enables extraction of biological insights from exonic regions of eukaryotic genomes.

Methodology:

Computational methods explicitly include trimming and filtering for quality control, gapped alignment to reference genomes, BAM conversion/sorting/indexing, duplicate removal, indel optimization, base quality score recalibration, SNP and indel calling, variant annotation, and storage in custom databases supporting cross-linking and statistical analyses; the tool leverages state-of-the-art algorithms that are regularly updated and tested to maximize variant quality while minimizing artifacts and false positives.

Topics

Details

Maturity:
Mature
Cost:
Free of charge (with restrictions)
Tool Type:
web application
Operating Systems:
Linux, Windows, Mac
Added:
1/22/2015
Last Updated:
11/25/2024

Operations

Data Inputs & Outputs

Sequence analysis

Publications

D'Antonio M, D'Onorio De Meo P, Paoletti D, Elmi B, Pallocca M, Sanna N, Picardi E, Pesole G, Castrignanò T. WEP: a high-performance analysis pipeline for whole-exome data. BMC Bioinformatics. 2013;14(S7). doi:10.1186/1471-2105-14-s7-s11. PMID:23815231. PMCID:PMC3633005.

Documentation