CloudBurst

CloudBurst is a parallel read-mapping algorithm designed to map next-generation sequencing data to reference genomes, with the aim of facilitating SNP discovery, genotyping, and personal genomics. It is a MapReduce-based tool, which allows parallelization and scaling to larger compute resources. CloudBurst's performance was tested and showed near-linear speedup with the number of processors used, making it faster than the single-processor read-mapping algorithm RMAP, and reduced the running time from hours to minutes for mapping millions of short reads to the human genome.

Topic

Genotype and phenotype;Personalised medicine

Detail

  • Operation: Read mapping

  • Software interface: Command-line user interface

  • Language: Java

  • License: -

  • Cost: Free

  • Version name: 1.1.0

  • Credit: National Institutes of Health, Department of Homeland Security.

  • Input: -

  • Output: -

  • Contact: Michael Schatz mschatz@umiacs.umd.edu

  • Collection: -

  • Maturity: Legacy

Publications

Download and documentation


< Back to DB search