Biostrings

Biostrings provides efficient manipulation, storage, and pattern analysis of large biological sequences (DNA, RNA, and amino acid sequences) for analysis of high-throughput genomic data within the R/Bioconductor environment.


Key Features:

  • Memory Efficiency: Memory-efficient string containers optimized for handling large-scale biological sequence datasets.
  • String Matching Algorithms: Advanced string-matching algorithms enabling precise and fast identification of sequence patterns, supporting motif discovery and sequence alignment.
  • Interoperability: Integration with the Bioconductor ecosystem to enable interoperability with other Bioconductor packages.
  • Rigorous Testing and Review: Subject to formal initial review and continuous automated testing within the Bioconductor framework to ensure reliability.

Scientific Applications:

  • Sequence Analysis: Manipulation and analysis of DNA, RNA, and protein sequences for genomic research.
  • Pattern Recognition: Identification of motifs and specific sequence patterns relevant to genetic function and regulatory mechanisms.
  • Data Integration: Combining sequence data with other genomic datasets to support comprehensive analyses of biological processes.

Methodology:

Implements memory-efficient data structures and string-matching algorithms within R and the Bioconductor framework.

Topics

Collections

Details

License:
Artistic-2.0
Cost:
Free of charge
Tool Type:
command-line tool, library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
R
Added:
1/17/2017
Last Updated:
11/25/2024

Operations

Publications

Huber W, Carey VJ, Gentleman R, Anders S, Carlson M, Carvalho BS, Bravo HC, Davis S, Gatto L, Girke T, Gottardo R, Hahne F, Hansen KD, Irizarry RA, Lawrence M, Love MI, MacDonald J, Obenchain V, Oleś AK, Pagès H, Reyes A, Shannon P, Smyth GK, Tenenbaum D, Waldron L, Morgan M. Orchestrating high-throughput genomic analysis with Bioconductor. Nature Methods. 2015;12(2):115-121. doi:10.1038/nmeth.3252. PMID:25633503. PMCID:PMC4509590.

Documentation

Downloads