GEOquery
GEOquery retrieves and parses Gene Expression Omnibus (GEO) datasets into R/Bioconductor objects for analysis of gene expression and related genomic data.
Key Features:
- Access to NCBI GEO: Provides programmatic retrieval of datasets from the Gene Expression Omnibus repository, encompassing nearly 140,000 gene expression datasets across organisms, tissue types, treatment conditions, and disease states.
- Integration with Bioconductor and R: Converts GEO data into Bioconductor-compatible R objects to enable downstream analysis with Bioconductor packages.
- Parsing and formatting: Handles the formatting and parsing of GEO datasets to produce analysis-ready data structures.
- Microarray and genomic data support: Enables application of methods for microarray and genomic data analysis within the Bioconductor ecosystem.
- Support for individual and meta-analysis: Facilitates both single-study analyses and meta-analyses of published gene expression datasets.
Scientific Applications:
- Microarray data analysis: Analysis of microarray experiment results available in GEO.
- Genomic data analysis: Application of genomic analysis methods to GEO-derived datasets within Bioconductor.
- Meta-analysis of published expression data: Aggregation and combined analysis of multiple GEO studies for meta-analytic inference.
- Comparative studies: Comparative analyses across organisms, tissue types, treatment conditions, and disease states using GEO datasets.
Methodology:
Parses and reformats GEO records into Bioconductor-compatible R objects to enable downstream microarray and genomic analyses.
Topics
Collections
Details
- License:
- GPL-2.0
- Tool Type:
- command-line tool, library
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- R
- Added:
- 1/17/2017
- Last Updated:
- 12/24/2018
Operations
Publications
Davis S, Meltzer PS. GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics. 2007;23(14):1846-1847. doi:10.1093/bioinformatics/btm254. PMID:17496320.
PMID: 17496320