Mukhopadhyay Ranjan, Emberly Eldon, Tang Chao, Wingreen Ned S
NEC Laboratories America, Inc., 4 Independence Way, Princeton, New Jersey 08540, USA.
Phys Rev E Stat Nonlin Soft Matter Phys. 2003 Oct;68(4 Pt 1):041904. doi: 10.1103/PhysRevE.68.041904. Epub 2003 Oct 7.
We construct a base-stacking model of RNA secondary-structure formation and use it to study the mapping from sequence to structure. There are strong, qualitative differences between two-letter and four- or six-letter alphabets. With only two kinds of bases, most sequences have many alternative folding configurations and are consequently thermally unstable. Stable ground states are found only for a small set of structures of high designability, i.e., total number of associated sequences. In contrast, sequences made from four bases, as found in nature, or six bases have far fewer competing folding configurations, resulting in a much greater average stability of the ground state.
我们构建了一个RNA二级结构形成的碱基堆积模型,并使用它来研究从序列到结构的映射。双字母和四字母或六字母字母表之间存在显著的定性差异。只有两种碱基时,大多数序列有许多替代折叠构型,因此热稳定性差。仅在一小部分具有高可设计性(即相关序列总数)的结构中发现稳定的基态。相比之下,自然界中发现的由四种碱基或六种碱基组成的序列具有少得多的竞争性折叠构型,导致基态的平均稳定性大大提高。