Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden.
Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, Sweden.
Physiol Plant. 2024 Sep-Oct;176(5):e14537. doi: 10.1111/ppl.14537.
Long non-coding RNAs (lncRNAs) have emerged as important regulators of many biological processes, although their regulatory roles remain poorly characterized in woody plants, especially in gymnosperms. A major challenge of working with lncRNAs is to assign functional annotations, since they have a low coding potential and low cross-species conservation. We utilised an existing RNA-Sequencing resource and performed short RNA sequencing of somatic embryogenesis developmental stages in Norway spruce (Picea abies L. Karst). We implemented a pipeline to identify lncRNAs located within the intergenic space (lincRNAs) and generated a co-expression network including protein coding, lincRNA and miRNA genes. To assign putative functional annotation, we employed a guilt-by-association approach using the co-expression network and integrated these results with annotation assigned using semantic similarity and co-expression. Moreover, we evaluated the relationship between lincRNAs and miRNAs, and identified which lincRNAs are conserved in other species. We identified lincRNAs with clear evidence of differential expression during somatic embryogenesis and used network connectivity to identify those with the greatest regulatory potential. This work provides the most comprehensive view of lincRNAs in Norway spruce and is the first study to perform global identification of lincRNAs during somatic embryogenesis in conifers. The data have been integrated into the expression visualisation tools at the PlantGenIE.org web resource to enable easy access to the community. This will facilitate the use of the data to address novel questions about the role of lincRNAs in the regulation of embryogenesis and facilitate future comparative genomics studies.
长非编码 RNA(lncRNA)已成为许多生物过程的重要调控因子,尽管它们在木本植物(尤其是裸子植物)中的调控作用仍未得到充分描述。lncRNA 的一个主要挑战是赋予其功能注释,因为它们的编码潜力低且跨物种保守性低。我们利用现有的 RNA 测序资源,对挪威云杉(Picea abies L. Karst)体细胞胚胎发生发育阶段进行了短 RNA 测序。我们实施了一个流程来鉴定位于基因间区的 lncRNA(lincRNA),并生成了一个包含蛋白质编码、lincRNA 和 miRNA 基因的共表达网络。为了赋予可能的功能注释,我们使用共表达网络采用关联罪责的方法,并将这些结果与使用语义相似性和共表达分配的注释进行整合。此外,我们评估了 lincRNA 和 miRNA 之间的关系,并鉴定了在其他物种中保守的 lincRNA。我们鉴定了在体细胞胚胎发生过程中具有明显差异表达的 lincRNA,并利用网络连接性来识别具有最大调控潜力的 lincRNA。这项工作提供了挪威云杉 lincRNA 的最全面视图,也是首次在针叶树中进行体细胞胚胎发生过程中 lincRNA 的全局鉴定。该数据集已整合到 PlantGenIE.org 表达可视化工具中,以便社区能够轻松访问。这将有助于利用这些数据来解决 lincRNA 在胚胎发生调控中的作用的新问题,并促进未来的比较基因组学研究。