Harman

Harman is a software tool designed to address the pervasive issue of batch effects in high-throughput genomic datasets. Batch are effects that can undermine scientific utility, ranging from reducing the power of statistical tests to rendering datasets useless. Harman introduces a technique that maximizes the removal of batch noise while minimizing the loss of biologically meaningful signals, with the user setting the acceptable risk level. Through analyzing three public datasets, Harman demonstrated superior performance in removing batch noise and preserving signal compared to existing methods. It offers consistent noise suppression and signal preservation across various studies, making it highly advantageous for meta-analyses and data integration.

Topic

Data quality management;Statistics and probability

Detail

  • Operation: Data handling;Principal component plotting

  • Software interface: Command-line user interface,Library

  • Language: R

  • License: The GNU General Public License v3.0

  • Cost: Free

  • Version name: 1.30.0

  • Credit: CSIRO Transformational Biology Capability Platform, CSIRO Nanosafety Theme.

  • Input: -

  • Output: -

  • Contact: Jason Ross jason.ross@csiro.au

  • Collection: -

  • Maturity: Stable

Publications

  • Risk-conscious correction of batch effects: maximising information extraction from high-throughput genomic datasets.
  • Oytam Y, et al. Risk-conscious correction of batch effects: maximising information extraction from high-throughput genomic datasets. Risk-conscious correction of batch effects: maximising information extraction from high-throughput genomic datasets. 2016; 17:332. doi: 10.1186/s12859-016-1212-5
  • https://doi.org/10.1186/s12859-016-1212-5
  • PMID: 27585881
  • PMC: PMC5009651

Download and documentation


< Back to DB search