State Key Laboratory of Rice Biology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, 310058, China.
State Key Laboratory of Crop Stress Biology in Arid Areas, College of Agronomy and Yangling Branch of China Wheat Improvement Center, Northwest A&F University, Yangling, 712100, China.
Sci China Life Sci. 2018 Feb;61(2):190-198. doi: 10.1007/s11427-017-9174-9. Epub 2017 Oct 31.
Long non-coding RNAs (lncRNAs) are gene regulators that have vital roles in development and adaptation to the environment in eukaryotes. However, the structural and evolutionary analyses of plant lncRNAs are limited. In this study, we performed an analysis of lncRNAs in five monocot and five dicot species. Our results showed that plant lncRNA genes were generally shorter and had fewer exons than protein-coding genes. The numbers of lncRNAs were positively correlated with the numbers of protein-coding genes in different plant species, despite a high range of variation. Sequence conservation analysis showed that the majority of lncRNAs had high sequence conservation at the intra-species and sub-species levels, reminiscent of protein-coding genes. At the inter-species level, a subset of lncRNAs were highly diverged at the nucleotide level, but conserved by position. Interestingly, we found that plant lncRNAs have identical splicing signals, and those which can form precursors or targets of miRNAs have a conservative identity in different species. We also revealed that most of the lowly expressed lncRNAs were tissue-specific, while those highly conserved were constitutively transcribed. Meanwhile, we characterized a subset of rice lncRNAs that were co-expressed with their adjacent protein-coding genes, suggesting they may play cis-regulatory roles. These results will contribute to understanding the biological significance and evolution of lncRNAs in plants.
长非编码 RNA(lncRNA)是真核生物中发育和适应环境的重要基因调控因子。然而,植物 lncRNA 的结构和进化分析受到限制。在本研究中,我们对五种单子叶植物和五种双子叶植物中的 lncRNA 进行了分析。结果表明,与蛋白质编码基因相比,植物 lncRNA 基因通常较短,外显子较少。不同植物物种的 lncRNA 数量与蛋白质编码基因数量呈正相关,尽管存在很大的变异性。序列保守性分析表明,大多数 lncRNA 在种内和亚种水平具有高度的序列保守性,类似于蛋白质编码基因。在种间水平上,一部分 lncRNA 在核苷酸水平上高度分化,但在位置上保守。有趣的是,我们发现植物 lncRNA 具有相同的剪接信号,并且那些能够形成 miRNA 前体或靶标的 lncRNA 在不同物种中具有保守的同一性。我们还揭示了大多数低表达的 lncRNA 是组织特异性的,而那些高度保守的则是组成型转录的。同时,我们对一组水稻 lncRNA 进行了特征描述,这些 lncRNA 与它们相邻的蛋白质编码基因共表达,表明它们可能发挥顺式调控作用。这些结果将有助于理解 lncRNA 在植物中的生物学意义和进化。