Pegueroles Cinta, Gabaldón Toni
Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona, 08003, Spain.
Universitat Pompeu Fabra (UPF), Barcelona, 08003, Spain.
BMC Biol. 2016 Jul 25;14:60. doi: 10.1186/s12915-016-0283-0.
Metazoans transcribe many long non-coding RNAs (lncRNAs) that are poorly conserved and whose function remains unknown. This has raised the questions of what fraction of the predicted lncRNAs is actually functional, and whether selection can effectively constrain lncRNAs in species with small effective population sizes such as human populations.
Here we evaluate signatures of selection in human lncRNAs using inter-specific data and intra-specific comparisons from five major populations, as well as by assessing relationships between sequence variation and predictions of secondary structure. In all analyses we included a reference of functionally characterized lncRNAs. Altogether, our results show compelling evidence of recent purifying selection acting on both characterized and predicted lncRNAs. We found that RNA secondary structure constrains sequence variation in lncRNAs, so that polymorphisms are depleted in paired regions with low accessibility and tend to be neutral with respect to structural stability.
Important implications of our results are that secondary structure plays a role in the functionality of lncRNAs, and that the set of predicted lncRNAs contains a large fraction of functional ones that may play key roles that remain to be discovered.
后生动物转录出许多保守性较差且功能未知的长链非编码RNA(lncRNA)。这引发了以下问题:预测的lncRNA中有多少实际上具有功能,以及在诸如人类群体等有效种群规模较小的物种中,选择是否能有效地限制lncRNA。
在这里,我们使用五个主要群体的种间数据和种内比较,以及通过评估序列变异与二级结构预测之间的关系,来评估人类lncRNA中的选择特征。在所有分析中,我们纳入了功能已明确的lncRNA作为参考。总体而言,我们的结果显示了近期纯化选择作用于已明确和预测的lncRNA的有力证据。我们发现RNA二级结构限制了lncRNA中的序列变异,因此在可及性低的配对区域中多态性减少,并且在结构稳定性方面倾向于呈中性。
我们结果的重要意义在于,二级结构在lncRNA的功能中起作用,并且预测的lncRNA集合中包含很大一部分可能发挥有待发现的关键作用的功能性lncRNA。