Bio-DIA
Bio-DIA integrates heterogeneous biological datasets and algorithms into structured, reproducible workflows for bioinformatics analyses.
Key Features:
- Workflow process construction: Workflows are specified using XML files to produce structured and reproducible data-analysis processes.
- Backend processing: Django and Apache Spark handle data operations and large-scale dataset processing across multiple formats.
- Collaboration and sharing: The platform enables sharing of data, scripts, and results among research groups to support collaborative analyses.
- Reusability of results: Analysis outputs and intermediate information can be reused in subsequent workflows to build upon prior work.
Scientific Applications:
- Genomic analysis: Integration of heterogeneous data and algorithms to support genomic studies.
- Proteomics: Facilitates integration and analysis of proteomics datasets and associated algorithms.
- Multi-omics and large-scale datasets: Supports integration and processing of other omics data and big-data formats for large-scale biological studies.
Methodology:
Methodology uses structured workflows defined in XML files with a backend composed of Django and Apache Spark for data processing and algorithm execution.
Topics
Details
- Programming Languages:
- Python
- Added:
- 1/14/2020
- Last Updated:
- 12/9/2020
Operations
Publications
Dantas T, da Silva VL, Fonseca AF, Morais DAA, Signoretti A, Blanco W. Bio-DIA: A web-based tool for data and algorithms integration. Unknown Journal. 2019. doi:10.1101/2019.12.13.875666.