RNA二级结构的统计数据。

Statistics of RNA secondary structures.

作者信息

Fontana W, Konings D A, Stadler P F, Schuster P

机构信息

Theoretical Division, Los Alamos National Laboratory, New Mexico 87545.

出版信息

Biopolymers. 1993 Sep;33(9):1389-404. doi: 10.1002/bip.360330909.

DOI:10.1002/bip.360330909

PMID:7691201

Abstract

A statistical reference for RNA secondary structures with minimum free energies is computed by folding large ensembles of random RNA sequences. Four nucleotide alphabets are used: two binary alphabets, AU and GC, the biophysical AUGC and the synthetic GCXK alphabet. RNA secondary structures are made of structural elements, such as stacks, loops, joints, and free ends. Statistical properties of these elements are computed for small RNA molecules of chain lengths up to 100. The results of RNA structure statistics depend strongly on the particular alphabet chosen. The statistical reference is compared with the data derived from natural RNA molecules with similar base frequencies. Secondary structures are represented as trees. Tree editing provides a quantitative measure for the distance dt, between two structures. We compute a structure density surface as the conditional probability of two structures having distance t given that their sequences have distance h. This surface indicates that the vast majority of possible minimum free energy secondary structures occur within a fairly small neighborhood of any typical (random) sequence. Correlation lengths for secondary structures in their tree representations are computed from probability densities. They are appropriate measures for the complexity of the sequence-structure relation. The correlation length also provides a quantitative estimate for the mean sensitivity of structures to point mutations.

摘要

通过折叠大量随机RNA序列集合来计算具有最小自由能的RNA二级结构的统计参考值。使用了四种核苷酸字母表：两种二元字母表，AU和GC，生物物理的AUGC以及合成的GCXK字母表。RNA二级结构由诸如堆叠、环、接头和自由末端等结构元件组成。针对链长高达100的小RNA分子计算这些元件的统计特性。RNA结构统计的结果在很大程度上取决于所选择的特定字母表。将该统计参考值与从具有相似碱基频率的天然RNA分子获得的数据进行比较。二级结构表示为树。树编辑为两个结构之间的距离dt提供了一种定量度量。我们将结构密度表面计算为两个结构在其序列距离为h的情况下具有距离t的条件概率。该表面表明，绝大多数可能的最小自由能二级结构出现在任何典型（随机）序列的相当小的邻域内。从概率密度计算其树表示形式的二级结构的相关长度。它们是序列 - 结构关系复杂性的合适度量。相关长度还为结构对单点突变的平均敏感性提供了定量估计。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

RNA二级结构的统计数据。

Statistics of RNA secondary structures.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

RNA二级结构的统计数据。

Statistics of RNA secondary structures.

作者信息

机构信息

出版信息

相似文献

引用本文的文献