Biopython

Biopython provides Python libraries for parsing and writing biological sequence files, performing multiple sequence alignments, manipulating 3D macromolecular structures, querying online biological databases, interfacing with BLAST, ClustalW, and EMBOSS, and applying numerical methods for statistical learning to bioinformatics data.


Key Features:

  • Sequence File Handling: Modules for reading and writing various sequence file formats to support sequence data manipulation.
  • Multiple Sequence Alignments: Support for performing multiple sequence alignment operations used in comparative analyses.
  • 3D Molecular Structures: Capabilities to handle and manipulate 3D macromolecular structures for structural analyses.
  • Integration with Common Tools: Interfaces to interact with BLAST, ClustalW, and EMBOSS for external sequence analysis and alignment tasks.
  • Database Access: Interfaces to access key online biological databases to retrieve genomic and biological data.
  • Statistical Learning Methods: Inclusion of numerical methods for statistical learning to support machine learning analyses in genomics.

Scientific Applications:

  • Comparative Genomics: Multiple sequence alignment functionality supports comparative genomics and evolutionary biology studies.
  • Structural Biology: Handling of 3D macromolecular structures enables structural biology and protein modeling analyses.
  • Sequence Analysis: Sequence file parsing and writing support diverse sequence analysis workflows.
  • Genomic Data Retrieval: Database interfaces enable retrieval and analysis of large-scale genomic datasets.
  • Machine Learning in Genomics: Numerical statistical learning methods facilitate machine learning applications on genomic data.

Methodology:

Parsing and writing of sequence file formats; multiple sequence alignment operations; handling of 3D macromolecular structure data; programmatic interaction with BLAST, ClustalW, and EMBOSS; interfaces to online biological databases; and numerical methods for statistical learning.

Topics

Details

License:
MIT
Maturity:
Mature
Tool Type:
library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
Python
Added:
1/13/2017
Last Updated:
12/10/2018

Operations

Publications

Cock PJ, et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009; 25:1422-3. doi: 10.1093/bioinformatics/btp163

PMID: 19304878

Documentation