LALIGN (EBI)
LALIGN (EBI) identifies internal duplications by calculating non-intersecting local sequence alignments to detect similar regions within biological sequences.
Key Features:
- Local sequence alignment: Performs local alignments to detect regions of similarity within larger sequences without requiring end-to-end alignment.
- Non-intersecting alignments: Reports alignments that do not overlap, enabling clear delineation of distinct duplicated regions.
- Internal duplication detection: Identifies internal duplications within nucleotide or protein sequences as distinct alignment segments.
- EMBL-EBI Job Dispatcher integration: Operates within the EMBL-EBI Job Dispatcher sequence analysis framework.
- Programmatic access: Accessible via RESTful and SOAP-based APIs for automated sequence analysis workflows.
- Data integration: Can be used in conjunction with EMBL-EBI resources such as UniProt, InterPro, ENA, and Ensembl Genomes for sequence data input and interpretation.
Scientific Applications:
- Evolutionary analysis: Detects duplicated regions relevant to studies of gene duplication and molecular evolution.
- Comparative genomics: Identifies internal sequence similarities useful for inter-species and intra-genome comparisons.
- Functional genomics and structural variation: Supports identification of duplicated segments that may affect gene function or genomic structure.
Methodology:
Calculates local sequence alignments constrained to be non-intersecting and executes within the EMBL-EBI Job Dispatcher framework with programmatic access via RESTful and SOAP-based APIs.
Topics
Collections
Details
- Tool Type:
- api, web application
- Operating Systems:
- Linux, Windows, Mac
- Added:
- 1/29/2015
- Last Updated:
- 11/24/2024
Operations
Publications
Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Research. 2022;50(W1):W276-W279. doi:10.1093/nar/gkac240. PMID:35412617. PMCID:PMC9252731.
Pearson WR. Searching protein sequence libraries: Comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms. Genomics. 1991;11(3):635-650. doi:10.1016/0888-7543(91)90071-l.
Madeira F, Madhusoodanan N, Lee J, Eusebi A, Niewielska A, Tivey ARN, Lopez R, Butcher S. The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Research. 2024;52(W1):W521-W525. doi:10.1093/nar/gkae241. PMID:38597606. PMCID:PMC11223882.
Documentation
Downloads
- Source codehttps://github.com/wrpearson/fasta36