NCBI Taxonomy Database

NCBI Taxonomy Database provides standardized nomenclature and taxonomic classification linking organism names and taxonomic lineages to nucleotide and protein sequence records in the International Nucleotide Sequence Database Collaboration (INSDC; GenBank, ENA/EMBL, DDBJ) to support sequence-based and phylogenetic analyses.


Key Features:

  • Standardized nomenclature and hierarchy: Maintains a curated taxonomic hierarchy and standard organism names for sequence records in INSDC databases (GenBank, ENA/EMBL, DDBJ).
  • Sequence-to-taxon mapping: Associates nucleotide and protein sequence records with organism names and complete taxonomic lineages.
  • Manual curation: Curated by NCBI scientists using current taxonomic literature to maintain up-to-date phylogenetic classification.
  • Integration with NCBI resources: Supplies taxonomic context used by BLAST, Entrez Gene, LocusLink, PubMed linking, and other NCBI tools.
  • Internal Entrez linking and clustering: Enables clustering of elements and internal linking across the Entrez system and other NCBI domains.
  • External resource linkage: Provides connections from taxa to taxon-specific external web resources.

Scientific Applications:

  • Sequence annotation and indexing: Indexes and retrieves nucleotide and protein sequences by taxonomy for genomics and molecular biology workflows.
  • Phylogenetic and comparative analyses: Provides curated taxonomic lineages for phylogenetic classification and comparative studies.
  • Taxonomic context for similarity searches: Supplies organism and lineage information to interpret BLAST and other sequence similarity results.
  • Cross-referencing gene data: Links gene-specific records in Entrez Gene and LocusLink to broader taxonomic information for integrative analyses.

Methodology:

Manual curation by NCBI scientists using current taxonomic literature, mapping organism names and taxonomic lineages to INSDC sequence records, and providing internal Entrez linking and clustering.

Topics

Collections

Details

Tool Type:
web application
Operating Systems:
Linux, Windows, Mac
Added:
3/30/2017
Last Updated:
11/25/2024

Operations

Data Inputs & Outputs

Publications

Maglott D. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Research. 2004;33(Database issue):D54-D58. doi:10.1093/nar/gki031. PMID:15608257. PMCID:PMC539985.

Federhen S. The NCBI Taxonomy database. Nucleic Acids Research. 2011;40(D1):D136-D143. doi:10.1093/nar/gkr1178. PMID:22139910. PMCID:PMC3245000.

Wheeler DL. Database resources of the National Center for Biotechnology Information. Nucleic Acids Research. 2000;28(1):10-14. doi:10.1093/nar/28.1.10. PMID:10592169. PMCID:PMC102437.

Federhen S. The NCBI Taxonomy database. Nucleic Acids Research. 2011;40(D1):D136-D143. doi:10.1093/nar/gkr1178. PMID:22139910. PMCID:PMC3245000.

Documentation