biomaRt
biomaRt provides programmatic access to BioMart databases to retrieve and integrate genomic and proteomic annotations, SNP information, Gene Ontology terms, and OMIM annotations for bioinformatic analyses.
Key Features:
- Integration with Bioconductor: Available as the biomaRt Bioconductor package for R, enabling annotation of gene symbols, chromosomal coordinates, Gene Ontology terms, and OMIM entries.
- Data Retrieval Capabilities: Retrieves genomic sequences and single nucleotide polymorphism (SNP) information from BioMart databases such as Ensembl and supports direct SQL query execution for data access.
- Molecule Mapping and Data Integration: Supports mapping across gene-to-transcript-to-protein relationships and integration of experimental datasets via BioMart web services, enabling linkage of DNA sequence variation with mRNA and protein abundance.
- Dynamic Database Content: Accommodates evolving probe–target relationships in public databases (e.g., Ensembl) within a programmable and reproducible framework for data integration.
Scientific Applications:
- Genomic annotation: Annotation of genes with symbols, chromosomal coordinates, Gene Ontology terms, and OMIM identifiers.
- Variant annotation: Retrieval and annotation of SNPs and sequence variation from Ensembl for variant interpretation.
- Gene expression analysis: Mapping probes and transcripts across platforms to interpret gene expression datasets.
- Proteomics and cross-omic integration: Linking transcripts to proteins to integrate proteomic and transcriptomic data.
- Systems biology and pathway exploration: Integrating multi-omic annotations to support systems-level analyses and molecular pathway investigation.
- Cancer genomics: Querying cancer-related datasets hosted in BioMart resources for cancer genomics studies.
Methodology:
Data access and retrieval are performed via direct SQL queries to BioMart databases (e.g., Ensembl), via BioMart web services, and through the biomaRt Bioconductor package for R.
Topics
Collections
Details
- License:
- Artistic-2.0
- Maturity:
- Mature
- Cost:
- Free of charge
- Tool Type:
- command-line tool
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- R
- Added:
- 6/22/2017
- Last Updated:
- 11/24/2024
Operations
Publications
Rytik PG, et al. [The clinico-pathogenetic characteristics of Kaposi's sarcoma]. Klin Med (Mosk). 1992; 70:19-23.
Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nature Protocols. 2009;4(8):1184-1191. doi:10.1038/nprot.2009.97. PMID:19617889. PMCID:PMC3159387.
Durinck S, Moreau Y, Kasprzyk A, Davis S, De Moor B, Brazma A, Huber W. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics. 2005;21(16):3439-3440. doi:10.1093/bioinformatics/bti525. PMID:16082012.