UnoSeq

UnoSeq performs expression profiling and gene discovery from Illumina mRNA-seq next-generation sequencing (NGS) data in organisms lacking well-annotated reference genomes.


Key Features:

  • Java library implementation: Provided as a Java library for programmatic analysis of NGS expression data.
  • Illumina mRNA-seq support: Specifically leverages Illumina mRNA-seq data for transcriptome analysis.
  • Novel bioinformatics pipeline: Integrates assembled and annotated sequences from model organisms with information from related species to support analysis without a reference genome.
  • Application to CHO cells: Applied to Chinese hamster ovary (CHO) cells, including analysis under butyrate treatment, identifying sequences for over 13,000 genes.
  • Novel gene discovery: Added sequence information for approximately 5,000 novel genes to the CHO model.
  • Transcript completeness prediction: Predicted more than 6,000 transcript sequences to be complete, covering over 95% of their corresponding mouse orthologs.
  • Biological function analysis: Enables analysis of specific functions such as DNA replication and cell cycle control from expression data.

Scientific Applications:

  • Expression profiling in non-reference organisms: Facilitates transcriptome characterization where genome or transcriptome annotations are incomplete or absent.
  • CHO cell research: Supports gene discovery and expression analysis in CHO cells relevant to recombinant protein production and cell cycle studies.
  • Biotechnology and therapeutic protein production: Provides genetic and expression insights applicable to optimization of therapeutic protein expression systems.

Methodology:

Combines Illumina mRNA-seq data with a bioinformatics pipeline that integrates assembled and annotated sequences from related organisms to assemble and annotate transcripts in the absence of a complete reference genome.

Topics

Details

Tool Type:
library
Operating Systems:
Linux, Windows, Mac
Programming Languages:
Java
Added:
1/13/2017
Last Updated:
11/25/2024

Operations

Publications

Birzele F, Schaub J, Rust W, Clemens C, Baum P, Kaufmann H, Weith A, Schulz TW, Hildebrandt T. Into the unknown: expression profiling without genome sequence information in CHO by next generation sequencing. Nucleic Acids Research. 2010;38(12):3999-4010. doi:10.1093/nar/gkq116. PMID:20194116. PMCID:PMC2896516.