VDJPipe
VDJPipe performs pre-processing of high-throughput immune repertoire sequencing data to generate high-quality input for downstream analysis of B-cell and T-cell receptor repertoires.
Key Features:
- Comprehensive pre-processing tasks: Performs base composition and read quality statistics, quality filtering, homopolymer filtering, length and nucleotide filtering, merging of paired-end reads, barcode demultiplexing, 5' and 3' PCR primer matching, and collapsing of duplicate reads.
- Single-pass processing: Executes the pre-processing tasks in a single pass over input data files.
- Pipeline approach: Uses a sequential workflow where the output from each processing step is automatically fed into the next.
- Complex barcoding schemes: Accommodates complex barcode demultiplexing schemes commonly used in immunosequencing experiments.
- Computational efficiency: Benchmarked against pRESTO and reported to require less than 10% of pRESTO's run time on comparable datasets.
Scientific Applications:
- Immune repertoire sequencing: Pre-processing of large-scale immune repertoire sequencing datasets to produce high-quality reads for downstream analysis.
- B-cell and T-cell receptor repertoire analysis: Prepares input for analyses of B-cell receptor (BCR) and T-cell receptor (TCR) repertoires.
- Immunology and translational research: Supports studies of immune responses, vaccine development, and immunotherapy by supplying processed sequencing data.
Methodology:
Performs the listed pre-processing tasks in a single pass over input files within a sequential pipeline that chains each step's output into the next.
Topics
Details
- Tool Type:
- command-line tool
- Operating Systems:
- Linux
- Programming Languages:
- C++
- Added:
- 7/28/2018
- Last Updated:
- 11/25/2024
Operations
Publications
Christley S, Levin MK, Toby IT, Fonner JM, Monson NL, Rounds WH, Rubelt F, Scarborough W, Scheuermann RH, Cowell LG. VDJPipe: a pipelined tool for pre-processing immune repertoire sequencing data. BMC Bioinformatics. 2017;18(1). doi:10.1186/s12859-017-1853-z. PMID:29020925. PMCID:PMC5637252.
PMID: 29020925
PMCID: PMC5637252
Funding: - National Institute of Allergy and Infectious Diseases: 4R01AI097403-05