BioProject

BioProject organizes and links project-level metadata for biological research to facilitate discovery and interoperability across NCBI, EBI, and DDBJ archival repositories.


Key Features:

  • Centralized Metadata Repository: Captures descriptive information about research projects that generate submissions to NCBI, EBI, and DDBJ, enabling aggregation of diverse data types within single initiatives or consortia.
  • Integration with Archival Databases: Interconnects BioProject records with corresponding data stored in NCBI, EBI, and DDBJ archival repositories to support cross-database discovery and linking of related datasets.
  • BioSample Database Complementarity: Associates project-level records with the BioSample database to capture sample-specific descriptive information and ensure metadata coverage at both project and sample levels.
  • Enhanced Data Querying and Integration: Organizes and classifies project metadata to improve querying, locating, integration, and interpretation of datasets across NCBI, EBI, and DDBJ archival repositories.

Scientific Applications:

  • Multi-omics integration: Facilitates integration of genomics, proteomics, metabolomics, and other omics datasets by linking project-level metadata with archival data across repositories.
  • Data discovery and reuse: Enables discovery, aggregation, and reuse of datasets across NCBI, EBI, and DDBJ for secondary analyses and comparative studies.
  • Contextualized sample analysis: Supports studies requiring detailed biological context by connecting project-level information with sample-specific metadata in BioSample.
  • Consortium and large-scale project aggregation: Aggregates metadata from multi-institution initiatives to provide unified views of datasets generated by consortia.

Methodology:

Metadata are systematically collected, consistently formatted to promote interoperability, and linked to corresponding archival records across NCBI, EBI, and DDBJ.

Topics

Details

Tool Type:
web application
Operating Systems:
Linux, Windows, Mac
Programming Languages:
SQL
Added:
3/30/2017
Last Updated:
4/15/2021

Operations

Publications

Barrett T, et al. BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res. 2012; 40:D57-63. doi: 10.1093/nar/gkr1163

PMID: 22139929

Documentation