cshl_fastx_artifacts_filter
cshl_fastx_artifacts_filter filters sequencing reads that contain sequences with all but three identical bases to remove common high-throughput sequencing artifacts and improve the quality of downstream analyses.
Key Features:
- Artifact Identification: Identifies and filters reads in which all but three bases are identical to remove likely sequencing artifacts.
- Integration with Galaxy Platform: Operates within the Galaxy platform to be executed as part of reproducible computational workflows.
Scientific Applications:
- Quality Control in Sequencing Projects: Removes specific artifact-containing reads to improve data quality for downstream genomic analyses.
- Metagenomic and Phylogenetic Analyses: Supports metagenomic and phylogenetic studies by filtering artifact reads prior to analysis and can be used alongside services such as MetaGenSense and NGphylogeny.fr.
Methodology:
Operates within the Galaxy framework using the Galaxy execution engine, communicates with Galaxy via the Galaxy API or the Bioblend library, and runs on the Institut Pasteur cluster.
Topics
Collections
Details
- Maturity:
- Mature
- Tool Type:
- web application
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Perl
- Added:
- 12/19/2016
- Last Updated:
- 4/20/2021
Operations
Publications
Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Eberhard C, Grüning B, Guerler A, Hillman-Jackson J, Von Kuster G, Rasche E, Soranzo N, Turaga N, Taylor J, Nekrutenko A, Goecks J. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Research. 2016;44(W1):W3-W10. doi:10.1093/nar/gkw343. PMID:27137889. PMCID:PMC4987906.
Mareuil F, Doppelt-Azeroual O, Ménager H. A public Galaxy platform at Pasteur used as an execution engine for web services. Unknown Journal. 2017. doi:10.7490/f1000research.1114334.1.