Department of Genome Oriented Bioinformatics, Technische Universität München, Wissenschaftzentrum Weihenstephan, Maximus-von-Imhof-Forum 3, D-85354, Freising, Germany.
Nucleic Acids Res. 2012 Feb;40(3):956-62. doi: 10.1093/nar/gkr790. Epub 2011 Sep 27.
It is generally accepted that functionally important RNA structure is more conserved than sequence due to compensatory mutations that may alter the sequence without disrupting the structure. For small RNA molecules sequence-structure relationships are relatively well understood. However, structural bioinformatics of mRNAs is still in its infancy due to a virtual absence of experimental data. This report presents the first quantitative assessment of sequence-structure divergence in the coding regions of mRNA molecules based on recently published transcriptome-wide experimental determination of their base paring patterns. Structural resemblance in paralogous mRNA pairs quickly drops as sequence identity decreases from 100% to 85-90%. Structures of mRNAs sharing sequence identity below roughly 85% are essentially uncorrelated. This outcome is in dramatic contrast to small functional non-coding RNAs where sequence and structure divergence are correlated at very low levels of sequence similarity. The fact that very similar mRNA sequences can have vastly different secondary structures may imply that the particular global shape of base paired elements in coding regions does not play a major role in modulating gene expression and translation efficiency. Apparently, the need to maintain stable three-dimensional structures of encoded proteins places a much higher evolutionary pressure on mRNA sequences than on their RNA structures.
普遍认为,由于可能改变序列而不破坏结构的补偿突变,功能重要的 RNA 结构比序列更保守。对于小 RNA 分子,序列-结构关系相对容易理解。然而,由于缺乏实验数据,mRNA 的结构生物信息学仍处于起步阶段。本报告根据最近发表的基于全转录组实验确定的碱基配对模式,首次对 mRNA 分子编码区的序列-结构差异进行了定量评估。结构相似性在直系同源 mRNA 对中随着序列同一性从 100%下降到 85-90%而迅速下降。序列同一性低于约 85%的 mRNA 结构基本上没有相关性。这一结果与小的功能非编码 RNA 形成鲜明对比,后者在非常低的序列相似性水平上,序列和结构的差异是相关的。非常相似的 mRNA 序列可能具有截然不同的二级结构这一事实表明,编码区碱基配对元件的特定整体形状可能不会在调节基因表达和翻译效率方面发挥主要作用。显然,维持编码蛋白质的稳定三维结构对 mRNA 序列的进化压力比对其 RNA 结构的进化压力要大得多。