cshl_princeton_fastx_barcode_splitter
cshl_princeton_fastx_barcode_splitter splits FASTQ and FASTA sequencing reads by barcode to organize multiplexed NGS data for analysis of DNA- and RNA-based experiments.
Key Features:
- Barcode-Based Splitting: Uses barcodes embedded within sequencing reads to segregate reads into distinct output files.
- Support for FASTQ/FASTA Formats: Accepts FASTQ and FASTA input formats for nucleotide sequencing reads.
Scientific Applications:
- NGS Data Analysis: Enables separation of sequencing reads by barcode to support NGS studies, including analysis of bacterial RNA polymerase promoter melting activity across numerous DNA sequence variants.
- Facilitation of In Vitro Experiments: Allows parallel handling of thousands of sequences to accumulate data on sequence-dependent behavior of proteins acting on DNA.
Methodology:
Leverages barcodes as the criterion for splitting sequencing reads into separate files; automates data organization to support downstream analyses.
Topics
Collections
Details
- Maturity:
- Mature
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Perl
- Added:
- 12/19/2016
- Last Updated:
- 11/24/2024
Operations
Publications
Heyduk T, Heyduk E. Next Generation Sequencing-based analysis of RNA polymerase functions. Methods. 2015;86:37-44. doi:10.1016/j.ymeth.2015.04.030. PMID:25937393.
Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Eberhard C, Grüning B, Guerler A, Hillman-Jackson J, Von Kuster G, Rasche E, Soranzo N, Turaga N, Taylor J, Nekrutenko A, Goecks J. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Research. 2016;44(W1):W3-W10. doi:10.1093/nar/gkw343. PMID:27137889. PMCID:PMC4987906.
Mareuil F, Doppelt-Azeroual O, Ménager H. A public Galaxy platform at Pasteur used as an execution engine for web services. Unknown Journal. 2017. doi:10.7490/f1000research.1114334.1.