College of Bee Science and Biomedicine, Fujian Agriculture and Forestry University, Fuzhou 350002, China.
National & Local United Engineering Laboratory of Natural Biotoxin, Fuzhou 350002, China.
Genes (Basel). 2024 Aug 23;15(9):1111. doi: 10.3390/genes15091111.
is a single-cellular fungus that obligately infects the midgut epithelial cells of adult honeybees, causing bee microsporidiosis and jeopardizing bee health and production. This work aims to construct the full-length transcriptome of and conduct a relevant investigation using PacBio single-molecule real-time (SMRT) sequencing technology. Following PacBio SMRT sequencing, 41,950 circular consensus (CCS) were generated, and 25,068 full-length non-chimeric (FLNC) reads were then detected. After polishing, 4387 high-quality, full-length transcripts were gained. There are 778, 2083, 1202, 1559, 1457, 1232, 1702, and 3896 full-length transcripts that could be annotated to COG, GO, KEGG, KOG, Pfam, Swiss-Prot, eggNOG, and Nr databases, respectively. Additionally, 11 alternative splicing (AS) events occurred in 6 genes were identified, including 1 alternative 5' splice-site and 10 intron retention. The structures of 225 annotated genes in the reference genome were optimized, of which 29 genes were extended at both 5' UTR and 3' UTR, while 90 and 106 genes were, respectively, extended at the 5' UTR as well as 3' UTR. Furthermore, a total of 29 high-confidence lncRNAs were obtained, including 12 sense-lncRNAs, 10 lincRNAs, and 7 antisense-lncRNAs. Taken together, the high-quality, full-length transcriptome of was constructed and annotated, the structures of annotated genes in the reference genome were improved, and abundant new genes, transcripts, and lncRNAs were discovered. Findings from this current work offer a valuable resource and a crucial foundation for molecular and omics research on .
是一种单细胞真菌,它专性感染成年蜜蜂的中肠上皮细胞,导致蜜蜂微孢子虫病,危害蜜蜂的健康和生产。本工作旨在构建的全长转录组,并使用 PacBio 单分子实时 (SMRT) 测序技术进行相关研究。PacBio SMRT 测序后,生成了 41950 个圆形一致序列 (CCS),然后检测到 25068 个全长非嵌合 (FLNC) 读段。经过抛光,获得了 4387 条高质量的全长转录本。有 778、2083、1202、1559、1457、1232、1702 和 3896 条全长转录本分别可以注释到 COG、GO、KEGG、KOG、Pfam、Swiss-Prot、eggNOG 和 Nr 数据库。此外,在 6 个基因中鉴定出 11 个剪接事件,包括 1 个可变 5'剪接位点和 10 个内含子保留。225 个注释基因在 参考基因组中的结构得到了优化,其中 29 个基因在 5'UTR 和 3'UTR 两端都得到了扩展,90 个和 106 个基因分别在 5'UTR 和 3'UTR 得到了扩展。此外,共获得 29 条高可信度的 lncRNA,包括 12 条正义 lncRNA、10 条 lincRNA 和 7 条反义 lncRNA。总之,构建并注释了 的高质量全长转录组,改进了 参考基因组中注释基因的结构,发现了大量新的基因、转录本和 lncRNA。本研究为分子和组学研究提供了有价值的资源和重要基础。