cshl_fastx_artifacts_filter

cshl_fastx_artifacts_filter filters sequencing reads that contain sequences with all but three identical bases to remove common high-throughput sequencing artifacts and improve the quality of downstream analyses.


Key Features:

  • Artifact Identification: Identifies and filters reads in which all but three bases are identical to remove likely sequencing artifacts.
  • Integration with Galaxy Platform: Operates within the Galaxy platform to be executed as part of reproducible computational workflows.

Scientific Applications:

  • Quality Control in Sequencing Projects: Removes specific artifact-containing reads to improve data quality for downstream genomic analyses.
  • Metagenomic and Phylogenetic Analyses: Supports metagenomic and phylogenetic studies by filtering artifact reads prior to analysis and can be used alongside services such as MetaGenSense and NGphylogeny.fr.

Methodology:

Operates within the Galaxy framework using the Galaxy execution engine, communicates with Galaxy via the Galaxy API or the Bioblend library, and runs on the Institut Pasteur cluster.

Topics

Collections

Details

Maturity:
Mature
Tool Type:
web application
Operating Systems:
Linux, Windows, Mac
Programming Languages:
Perl
Added:
12/19/2016
Last Updated:
4/20/2021

Operations

Publications

Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Eberhard C, Grüning B, Guerler A, Hillman-Jackson J, Von Kuster G, Rasche E, Soranzo N, Turaga N, Taylor J, Nekrutenko A, Goecks J. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Research. 2016;44(W1):W3-W10. doi:10.1093/nar/gkw343. PMID:27137889. PMCID:PMC4987906.

Mareuil F, Doppelt-Azeroual O, Ménager H. A public Galaxy platform at Pasteur used as an execution engine for web services. Unknown Journal. 2017. doi:10.7490/f1000research.1114334.1.

Documentation

Links

Related Tools

fastx-toolkit
Relation: includedIn