Li Lin, Liu Houbo, Wen Weie, Huang Ceyin, Li Xiaomei, Xiao Shiji, Wu Mingkai, Shi Junhua, Xu Delin
Department of Cell Biology, Zunyi Medical University, Zunyi, China.
School of Pharmacy, Zunyi Medical University, Zunyi, China.
Front Genet. 2020 Oct 15;11:995. doi: 10.3389/fgene.2020.00995. eCollection 2020.
has been widely used in the pharmacology industry. To effectively produce the secondary metabolites through suspension cultured cells of , it is important to exploring the full-length transcriptome data and the genes related to cell growth and chemical producing of all culture stages. We applied a combination of Real-Time Sequencing of Single Molecule (SMRT) and second-generation sequencing (SGS) to generate the complete and full-length transcriptome of suspension cultured cells.
The transcriptome was formed in way by using PacBio isoform sequencing (Iso-Seq) on a pooled RNA sample derived from 23 samples of 10 culture stages, to explore the potential for capturing full-length transcript isoforms. All unigenes were obtained after splicing, assembling, and clustering, and corrected by the SGS results. The obtained unigenes were compared with the databases, and the functions were annotated and classified.
A total of 100,276 high-quality full-length transcripts were obtained, with an average length of 2530 bp and an N50 of 3302 bp. About 52% of total sequences were annotated against the Gene Ontology, 53,316 unigenes were hit by KOG annotations and divided into 26 functional categories, 80,020 unigenes were mapped by KEGG annotations and clustered into 363 pathways. Furthermore, 15,133 long-chain non-coding RNAs (lncRNAs) were detected. And 68,996 coding sequences were identified based on SSR analysis, among which 31 pairs of primers selected at random were amplified and obtained stable bands. In conclusion, our results provide new full-length transcriptome data and genetic resources for identifying growth and metabolism-related genes, which provide a solid foundation for further research on its growth regulation mechanisms and genetic engineering breeding mechanisms of .
已在制药行业广泛应用。为了通过悬浮培养细胞有效生产次生代谢产物,探索所有培养阶段的全长转录组数据以及与细胞生长和化学产物合成相关的基因非常重要。我们应用单分子实时测序(SMRT)和第二代测序(SGS)相结合的方法来生成悬浮培养细胞的完整全长转录组。
通过对来自10个培养阶段的23个样本的混合RNA样本进行PacBio异构体测序(Iso-Seq),以探索捕获全长转录异构体的潜力,从而以这种方式形成转录组。所有单基因在拼接、组装和聚类后获得,并通过SGS结果进行校正。将获得的单基因与数据库进行比较,并对功能进行注释和分类。
共获得100276条高质量全长转录本,平均长度为2530 bp,N50为3302 bp。约52%的总序列通过基因本体论进行注释,53316个单基因被KOG注释命中并分为26个功能类别,80020个单基因通过KEGG注释进行映射并聚类到363条途径中。此外,检测到15133条长链非编码RNA(lncRNA)。基于SSR分析鉴定出68996个编码序列,其中随机选择31对引物进行扩增并获得稳定条带。总之,我们的结果为鉴定生长和代谢相关基因提供了新的全长转录组数据和遗传资源,为进一步研究其生长调控机制和基因工程育种机制奠定了坚实基础。