CloudBurst
CloudBurst is a parallel read-mapping algorithm designed to map next-generation sequencing data to reference genomes, with the aim of facilitating SNP discovery, genotyping, and personal genomics. It is a MapReduce-based tool, which allows parallelization and scaling to larger compute resources. CloudBurst's performance was tested and showed near-linear speedup with the number of processors used, making it faster than the single-processor read-mapping algorithm RMAP, and reduced the running time from hours to minutes for mapping millions of short reads to the human genome.
Topic
Genotype and phenotype;Personalised medicine
Detail
Operation: Read mapping
Software interface: Command-line user interface
Language: Java
License: -
Cost: Free
Version name: 1.1.0
Credit: National Institutes of Health, Department of Homeland Security.
Input: -
Output: -
Contact: Michael Schatz mschatz@umiacs.umd.edu
Collection: -
Maturity: Legacy
Publications
- CloudBurst: highly sensitive read mapping with MapReduce.
- Schatz MC. CloudBurst: highly sensitive read mapping with MapReduce. CloudBurst: highly sensitive read mapping with MapReduce. 2009; 25:1363-9. doi: 10.1093/bioinformatics/btp236
- https://doi.org/10.1093/bioinformatics/btp236
- PMID: 19357099
- PMC: PMC2682523
Download and documentation
Source: https://sourceforge.net/projects/cloudburst-bio/files/cloudburst/
Documentation: https://sourceforge.net/p/cloudburst-bio/wiki/CloudBurst/
Data: https://sourceforge.net/projects/cloudburst-bio/files/cloudburst-data/
< Back to DB search