UnoSeq
UnoSeq performs expression profiling and gene discovery from Illumina mRNA-seq next-generation sequencing (NGS) data in organisms lacking well-annotated reference genomes.
Key Features:
- Java library implementation: Provided as a Java library for programmatic analysis of NGS expression data.
- Illumina mRNA-seq support: Specifically leverages Illumina mRNA-seq data for transcriptome analysis.
- Novel bioinformatics pipeline: Integrates assembled and annotated sequences from model organisms with information from related species to support analysis without a reference genome.
- Application to CHO cells: Applied to Chinese hamster ovary (CHO) cells, including analysis under butyrate treatment, identifying sequences for over 13,000 genes.
- Novel gene discovery: Added sequence information for approximately 5,000 novel genes to the CHO model.
- Transcript completeness prediction: Predicted more than 6,000 transcript sequences to be complete, covering over 95% of their corresponding mouse orthologs.
- Biological function analysis: Enables analysis of specific functions such as DNA replication and cell cycle control from expression data.
Scientific Applications:
- Expression profiling in non-reference organisms: Facilitates transcriptome characterization where genome or transcriptome annotations are incomplete or absent.
- CHO cell research: Supports gene discovery and expression analysis in CHO cells relevant to recombinant protein production and cell cycle studies.
- Biotechnology and therapeutic protein production: Provides genetic and expression insights applicable to optimization of therapeutic protein expression systems.
Methodology:
Combines Illumina mRNA-seq data with a bioinformatics pipeline that integrates assembled and annotated sequences from related organisms to assemble and annotate transcripts in the absence of a complete reference genome.
Topics
Details
- Tool Type:
- library
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- Java
- Added:
- 1/13/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Birzele F, Schaub J, Rust W, Clemens C, Baum P, Kaufmann H, Weith A, Schulz TW, Hildebrandt T. Into the unknown: expression profiling without genome sequence information in CHO by next generation sequencing. Nucleic Acids Research. 2010;38(12):3999-4010. doi:10.1093/nar/gkq116. PMID:20194116. PMCID:PMC2896516.