biomaRt

biomaRt provides programmatic access to BioMart databases to retrieve and integrate genomic and proteomic annotations, SNP information, Gene Ontology terms, and OMIM annotations for bioinformatic analyses.


Key Features:

  • Integration with Bioconductor: Available as the biomaRt Bioconductor package for R, enabling annotation of gene symbols, chromosomal coordinates, Gene Ontology terms, and OMIM entries.
  • Data Retrieval Capabilities: Retrieves genomic sequences and single nucleotide polymorphism (SNP) information from BioMart databases such as Ensembl and supports direct SQL query execution for data access.
  • Molecule Mapping and Data Integration: Supports mapping across gene-to-transcript-to-protein relationships and integration of experimental datasets via BioMart web services, enabling linkage of DNA sequence variation with mRNA and protein abundance.
  • Dynamic Database Content: Accommodates evolving probe–target relationships in public databases (e.g., Ensembl) within a programmable and reproducible framework for data integration.

Scientific Applications:

  • Genomic annotation: Annotation of genes with symbols, chromosomal coordinates, Gene Ontology terms, and OMIM identifiers.
  • Variant annotation: Retrieval and annotation of SNPs and sequence variation from Ensembl for variant interpretation.
  • Gene expression analysis: Mapping probes and transcripts across platforms to interpret gene expression datasets.
  • Proteomics and cross-omic integration: Linking transcripts to proteins to integrate proteomic and transcriptomic data.
  • Systems biology and pathway exploration: Integrating multi-omic annotations to support systems-level analyses and molecular pathway investigation.
  • Cancer genomics: Querying cancer-related datasets hosted in BioMart resources for cancer genomics studies.

Methodology:

Data access and retrieval are performed via direct SQL queries to BioMart databases (e.g., Ensembl), via BioMart web services, and through the biomaRt Bioconductor package for R.

Topics

Collections

Details

License:
Artistic-2.0
Maturity:
Mature
Cost:
Free of charge
Tool Type:
command-line tool
Operating Systems:
Linux, Windows, Mac
Programming Languages:
R
Added:
6/22/2017
Last Updated:
11/24/2024

Operations

Publications

Rytik PG, et al. [The clinico-pathogenetic characteristics of Kaposi's sarcoma]. Klin Med (Mosk). 1992; 70:19-23.

PMID: 1608201

Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nature Protocols. 2009;4(8):1184-1191. doi:10.1038/nprot.2009.97. PMID:19617889. PMCID:PMC3159387.

Durinck S, Moreau Y, Kasprzyk A, Davis S, De Moor B, Brazma A, Huber W. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics. 2005;21(16):3439-3440. doi:10.1093/bioinformatics/bti525. PMID:16082012.

Documentation