CloudBurst

CloudBurst is a parallel read-mapping algorithm designed to map next-generation sequencing data to reference genomes, with the aim of facilitating SNP discovery, genotyping, and personal genomics. It is a MapReduce-based tool, which allows parallelization and scaling to larger compute resources. CloudBurst's performance was tested and showed near-linear speedup with the number of processors used, making it faster than the single-processor read-mapping algorithm RMAP, and reduced the running time from hours to minutes for mapping millions of short reads to the human genome.

Topic

Genotype and phenotype;Personalised medicine

Detail

Operation: Read mapping
Software interface: Command-line user interface
Language: Java
License: -
Cost: Free
Version name: 1.1.0
Credit: National Institutes of Health, Department of Homeland Security.
Input: -
Output: -
Contact: Michael Schatz mschatz@umiacs.umd.edu
Collection: -
Maturity: Legacy

Publications

CloudBurst: highly sensitive read mapping with MapReduce.
Schatz MC. CloudBurst: highly sensitive read mapping with MapReduce. CloudBurst: highly sensitive read mapping with MapReduce. 2009; 25:1363-9. doi: 10.1093/bioinformatics/btp236
https://doi.org/10.1093/bioinformatics/btp236
PMID: 19357099
PMC: PMC2682523

Download and documentation

Source: https://sourceforge.net/projects/cloudburst-bio/files/cloudburst/
Documentation: https://sourceforge.net/p/cloudburst-bio/wiki/CloudBurst/
Home page: https://sourceforge.net/projects/cloudburst-bio/
Data: https://sourceforge.net/projects/cloudburst-bio/files/cloudburst-data/

< Back to DB search