Bio-DIA

Bio-DIA integrates heterogeneous biological datasets and algorithms into structured, reproducible workflows for bioinformatics analyses.


Key Features:

  • Workflow process construction: Workflows are specified using XML files to produce structured and reproducible data-analysis processes.
  • Backend processing: Django and Apache Spark handle data operations and large-scale dataset processing across multiple formats.
  • Collaboration and sharing: The platform enables sharing of data, scripts, and results among research groups to support collaborative analyses.
  • Reusability of results: Analysis outputs and intermediate information can be reused in subsequent workflows to build upon prior work.

Scientific Applications:

  • Genomic analysis: Integration of heterogeneous data and algorithms to support genomic studies.
  • Proteomics: Facilitates integration and analysis of proteomics datasets and associated algorithms.
  • Multi-omics and large-scale datasets: Supports integration and processing of other omics data and big-data formats for large-scale biological studies.

Methodology:

Methodology uses structured workflows defined in XML files with a backend composed of Django and Apache Spark for data processing and algorithm execution.

Topics

Details

Programming Languages:
Python
Added:
1/14/2020
Last Updated:
12/9/2020

Operations

Publications

Dantas T, da Silva VL, Fonseca AF, Morais DAA, Signoretti A, Blanco W. Bio-DIA: A web-based tool for data and algorithms integration. Unknown Journal. 2019. doi:10.1101/2019.12.13.875666.