BLAST

BLAST identifies regions of local similarity between nucleotide or protein sequences and sequence databases and computes statistical significance of alignments to infer homologous relationships.


Key Features:

  • Local alignment (MSP): Approximates local alignments using the maximal segment pair (MSP) score to optimize local similarity.
  • Sequence types and databases: Compares nucleotide and protein query sequences against sequence databases.
  • Statistical scoring: Calculates statistical significance for alignments to assess the relevance of similarity hits.
  • Heuristic speed: Uses a heuristic approximation that yields roughly an order-of-magnitude speed improvement while maintaining comparable sensitivity.
  • Multiple-region detection: Detects and reports multiple similarity regions within long DNA sequences.
  • Comparative sensitivity: Exhibits lower sensitivity for detecting remote homologs compared with profile-based methods such as CS-BLAST and PHMMER.

Scientific Applications:

  • DNA and protein database searches: Identify homologous sequences by searching nucleotide or protein queries against sequence databases.
  • Motif identification: Detect conserved motifs within queries via local alignments.
  • Gene discovery: Support gene identification and annotation through sequence similarity.
  • Analysis of long sequences: Characterize multiple similarity regions across long DNA sequences.
  • Functional inference: Infer functional relationships between sequences based on alignment similarity and statistical significance.
  • Remote homolog detection (comparative): Detect remote homologs with less sensitivity than profile-based approaches such as CS-BLAST and PHMMER.

Methodology:

Approximates alignments by optimizing local similarity through the maximal segment pair (MSP) score, computes statistical significance of alignments, and employs a heuristic approach that delivers substantial speed improvements while retaining comparable sensitivity.

Topics

Collections

Details

Maturity:
Mature
Cost:
Free of charge
Tool Type:
api, command-line tool, web application
Operating Systems:
Linux, Windows, Mac
Added:
1/13/2017
Last Updated:
11/24/2024

Operations

Data Inputs & Outputs

Sequence alignment

Inputs

Outputs

    Publications

    Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. Journal of Molecular Biology. 1990;215(3):403-410. doi:10.1016/s0022-2836(05)80360-2. PMID:2231712.

    Saripella GV, Sonnhammer ELL, Forslund K. Benchmarking the next generation of homology inference tools. Bioinformatics. 2016;32(17):2636-2641. doi:10.1093/bioinformatics/btw305. PMID:27256311. PMCID:PMC5013910.

    Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden TL. NCBI BLAST: a better web interface. Nucleic Acids Research. 2008;36(Web Server):W5-W9. doi:10.1093/nar/gkn201. PMID:18440982. PMCID:PMC2447716.

    Boratyn GM, Camacho C, Cooper PS, Coulouris G, Fong A, Ma N, Madden TL, Matten WT, McGinnis SD, Merezhuk Y, Raytselis Y, Sayers EW, Tao T, Ye J, Zaretskaya I. BLAST: a more efficient report with usability improvements. Nucleic Acids Research. 2013;41(W1):W29-W33. doi:10.1093/nar/gkt282. PMID:23609542. PMCID:PMC3692093.

    Documentation